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CHAPTER  ONE 


INTRODUCTION 

1.1.  Overview  and  Previous  Work 

Traditional  surveillance  and  communication  systems  use  centralized 
signal  processing  for  the  detection,  identification  and  tracking  of 
targets.  In  these  systems,  the  received  observations  are  transmitted 
to  the  central  processor  (detector)  where  classical  hypothesis  testing 
procedures  are  employed  for  signal  processing  [1,2].  The  hypothesis 
testing  procedures  can  be  classified  into  two  major  categories 
depending  on  the  test  duration  or  the  number  of  observations  to  be 
processed.  In  fixed-sample-size  (FSS)  hypothesis  testing  procedures  a 
fixed  number  of  observations  are  taken  before  the  test  can  be  executed. 
The  sample  size  is  determined  by  the  performance  level  to  be  achieved. 

A  variety  of  approaches  such  as  the  Bayesian  approach  and  the  Neyman- 
Pearson  approach  can  be  employed  in  FSS  detection.  Contrary  to  the  FSS 
procedures,  the  sequential  procedures  have  a  random  test  duration 
depending  on  the  actual  observations  and  the  test  thresholds.  These 
procedures  provide  a  significant  advantage  over  FSS  test  procedures  in 
terms  of  test  length.  For  specified  values  of  the  error  probabilities 
a  and  P  (where  a  and  P  are  the  error  probabilities  of  the  first  and 
second  kind,  i.e.,  probabilities  of  false  alarm  and  miss  respectively), 
and  known  probability  distributions,  the  sequential  procedures,  on  the 
average,  require  significantly  less  number  of  observations  than  the  FSS 
test  procedures  [3-5] .  The  sequential  probability  ratio  test  (SPRT)  of 
Wald  [1,5]  is  known  to  be  optimal  among  all  sequential  test  procedures. 
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Distributed  detection  systems  have  become  an  attractive  alternative  to 
the  convential  centralized  detection  systems  for  processing  large 
quantities  of  data  received  from  physically  distributed  sensors.  The 
decomposition  of  processing  is  essential  for  controlling  the  complexity  of 
data  processing  and  to  meet  the  needs  for  real-time  results.  Moreover, 
limited  communication  is  often  necessary  due  to  physical  bandwidth 
constraints  combined  with  the  desire  to  limit  or  reduce  the  effect  of 
possible  jamming,  thereby  increasing  the  system  survivability.  Distri¬ 
buted  processing  is  the  natural  way  to  handle  a  situation  in  which 
various  sensors  with  different  sensing  techniques,  such  as  sonic,  micro- 
wave  and  infra-red  sensors  are  employed.  These  decentralized  systems 
exhibit  the  advantages  of  higher  reliability,  survivability,  and  shorter 
decision  time  than  their  centralized  counterparts  [7,8]. 

Distributed  FSS  detection  systems  have  received  an  increasing  interest 
over  the  past  several  years.  Different  approaches  have  been  used  to 
analyze  these  systems.  Tenney  and  Sandell  [7]  introduced  distributed 
detection  using  a  fixed  fusion  center  and  used  the  Bayesian  approach  to 
optimize  the  local  detectors.  Chair  and  Varshr.ev  [9]  fixed  the  local 
detectors  and  U3ed  the  Bayesian  approach  to  optimize  the  fusion  center. 
Hoballah  and  Varshney  [10]  using  the  Bayesian  approach  combined  the 
results  in  [7,9]  and  optimized  the  entire  distributed  detection  system. 
Reibman  and  Nolte  [11]  showed  how  to  use  numerical  approaches  to  find 
globally  optimum  system  for  a  particular  detection  problem.  Srinivasan 
[12]  considered  the  Neyman-Pearson  criterion  for  system  optimization.  He 
fixed  the  fusion  center  and  optimized  the  local  detectors.  Hoballah  and 
Var3hney  [13]  jointly  optimized  the  entire  distributed  detection  system 
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using  the  Neyman-Pearson  criterion.  Thomopoulos  and  Okello  [14]  considered 
a  distributed  system  in  which  the  primary  detector  communicates  with  the 
consultant  detector  only  when  the  primary  detector  can  not  decide  on  one 
of  the  hypotheses.  The  majority  of  the  distributed  detection  systems 
[7-13]  allow  the  transmission  of  only  one-bit  decisions  (hard  decisions) 
from  local  detectors  to  the  fusion  center.  However,  some  distributed 
detection  systems  [15-17]  allow  the  transmission  of  multi-bit  local 
decisions  (soft  decisions)  to  the  fusion  center.  These  soft  decision 
distributed  detection  systems  have  the  advantage  of  improved  performance 
over  their  hard  decision  counterparts  at  the  expense  of  increased 
communication  (channel  capacity)  as  well  as  increased  analysis  and 
processing  complexity. 

The  decentralized  sequential  detection  problem  has  been  investigated 
in  [18-20],  In  [18],  Teneketzis  formulated  and  solved  a  decentralized 
version  of  the  Wald  problem  with  two  decision  makers.  In  his  model,  each 
detector  was  given  the  flexibility  of  either  stopping  and  making  a  de¬ 
cision  or  continuing  to  the  next  stage.  The  coupling  between  the  two 
local  detectors  was  introduced  through  a  common  cost  function  and  the 
objective  was  to  minimize  the  cost.  He  showed  that  the  person-by-person 
optimal  policies  of  the  local  detectors  are  described  by  thresholds  which 
are  coupled.  More  specifically,  the  thresholds  of  detector  1  at  any  stage 
depend  on  the  thresholds  of  detector  2  at  all  stages.  For  a  two-detector 
N-stage  detection  system,  he  showed  that  the  thresholds  are  determined  by 
solving  a  set  of  4N-2  nonlinear  algebraic  equations  in  4N-2  unknowns. 
Hashemi  and  Rhodes  [19]  examined  a  two-step,  two-detector,  sequential 
hypothesis  testing  problem  with  data  fusion  center.  They  also  discussed 
its  straightforward  extension  to  a  decentralized  multi-stage  sequential 
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detection  problem.  Their  model  is  different  from  the  one  examined  in 
[18].  In  [19],  the  sequential  test  is  carried  out  at  the  fusion  center 
and  local  detectors  have  no  control  over  the  termination  of  the  test. 
Chair  et  al.  [20]  obtained  a  decentralized  version  of  the  SPRT  for  the 
same  model  as  in  [18]  .  However,  they  used  the  Neyman-Pearson  approach 
for  the  solution  of  the  problem.  In  [18-20],  the  computation  of  thresh¬ 
olds  was  quite  complicated  due  to  the  fact  that  the  thresholds  are 
coupled  at  all  stages .  Moreover,  the  emphasis  was  solely  on  the  computa¬ 
tion  of  the  optimal  thresholds.  The  performance  of  the  schemes  was 
totally  ignored.  Therefore,  it  is  desirable  to  explore  alternate  struc¬ 
tures  and  schemes  for  distributed  sequential  detection  where  the  thresh¬ 
olds  are  easier  to  obtain,  and  the  performance  of  the  schemes  can  be 
evaluated. 

In  this  report,  we  consider  alternate  sequential  detection  schemes 
and  evaluate  their  performance.  The  sequential  procedures  presented 
here  are  generalizations  of  the  classical  sequential  procedures  11,21- 
27]  to  a  distributed  environment.  Two  important  functions  frequently 
used  to  evaluate  the  performance  of  an  SPRT,  namely  the  average  sample 
number  (ASN)  function,  and  the  operating  characteristic  (OC)  function, 
are  evaluated  for  the  various  schemes  considered.  These  schemes  have 
the  desired  advantages  of  distributed  detection  system  over  their 
centralized  counterparts,  and,  in  addition,  their  performance  is  easier 
to  evaluate  than  the  schemes  in  [18-20] . 

1.2,  Report  Organization 

In  thi3  report,  we  consider  some  distributed  sequential  detection 
systems  and  evaluate  their  performance.  Most  of  the  distributed 
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systems  considered  here  are  parametric  where  an  exact  knowledge  of  the 
observation  statistics  is  assumed.  The  only  exception  is  the  non- 
parametric  distributed  system  considered  in  Chapter  Seven,  where  the 
observation  statistics  are  not  known  completely. 

In  Chapter  Two,  we  describe  the  sequential  probability  ratio  test 
(SPRT)  of  Wald  [1],  Concepts  such  as  average  sample  number  (ASN)  func¬ 
tion  and  operating  characteristic  (OC)  function,  which  are  essential  in 
evaluating  the  performance  of  sequential  test  are  introduced  in  some  de¬ 
tail.  Alternate  approaches  to  obtain  the  exact  expressions  (.28,29]  or 
the  OC  and  ASN  functions  are  also  described. 

In  Chapter  Three,  an  SPRT  based  on  quantized  multi -censor  observa¬ 
tions  is  described  and  its  performance  is  evaluated.  It  is  shown  that 
for  the  same  level  of  performance,  the  ASN  decreases  monotonically  by 
increasing  the  number  of  local  sensors.  The  issue  of  optimal  quanti¬ 
zation  for  sequential  detection  is  studied,  and  the  optimal  quantizer  is 
shown  to  be  a  likelihood  ratio  quantizer.  Moreover,  we  show  that  the 
quantizers  obtained  as  a  result  of  a  joint  optimization  are  the  same  as 
those  obtained  when  each  local  quantizer  is  optimized  independently.  The 
specified  performance  level  in  terms  of  error  probabilities  is  shown  to 
be  achievable  despite  channel  errors  at  the  expense  of  increased 
ASN's.  An  example  is  presented  to  illustrate  the  results  obtained  in 
this  chapter. 

In  Chapter  Four,  we  propose  and  analyze  a  simple  sequential  detection 
scheme  using  multiple  sensors.  The  performance  expressed  in  terms  of  the 
ASN  is  shown  to  improve  monotonically  with  the  number  of  local  detectors 
(sensors)  used.  Moreover,  we  show  that  when  truncated,  the  sequential 
test  based  on  multiple  sensors  has  a  smaller  increase  in  error  proba- 
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bilities  than  its  single  detector  counterpart.  A  numerical  example  is 
presented  to  illustrate  the  results  of  this  chapter. 

In  Chapter  Five,  we  generalize  the  SPRT  of  Wald  to  a  distributed 
system  consisting  of  two  local  detectors  and  a  global  decision  maker 
(GDM) .  Each  local  detector  performs  an  SPRT  based  on  its  own  observations 
and  communicates  its  local  decision  to  the  GDM.  The  GDM  combines  the 
local  decisions  according  to  a  predetermined  fusion  rule,  and  decides 
either  to  stop  and  accept  one  of  the  two  hypotheses,  or  to  continue.  The 
global  error  probabilities  are  shown  to  be  functions  of  the  local  error 
probabilities  and  the  fusion  rule.  The  global  test  duration  is  obtained 
for  various  possible  fusion  rules  and  its  average  is  derived.  An  example 
is  presented  for  the  case  of  two  identical  local  detectors. 

In  Chapter  Six,  a  modified  sequential  detection  procedure  is  proposed 
and  analyzed.  The  structure  of  the  distributed  system  is  the  same  as  that 
considered  in  Chapter  Five.  Each  local  detector  takes  a  group  of  Nq 
observations,  performs  a  likelihood  ratio  test,  and  decides  either  in 
favor  of  one  of  the  hypotheses  or  declares  its  inability  to  decide.  The 
local  decisions  (including  no  decision)  are  transmitted  to  the  GDM  which 
combines  the  local  decisions  according  to  a  predetermined  fusion  rule,  and 
decides  either  to  stop  the  local  tests  and  accept  one  of  the  hypotheses  or 
to  continue.  When  the  global  decision  is  to  continue,  the  detection 
process  is  repeated  ignoring  all  previous  stages.  The  binary  and  M-ary 
hypothesis  testing  problems  are  considered.  A  truncation  scheme  to 
limit  the  number  of  observations  from  being  excessively  large  is  proposed 
and  analyzed.  Numerical  results  are  presented  to  illustrate  the 
performance  of  the  system. 

In  Chapter  Seven,  the  nonparametric  sequential  conditional  sign  test 
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of  Shin  and  Kassam  [22]  is  first  studied  and  modified.  The  modified  test 


has  fixed  thresholds  that  are  independent  of  the  observations  and,  there¬ 
fore,  it  does  not  require  the  table  look-up  operation.  Both  the  sign  and 
conditional  sign  nonparametric  sequential  tests  [22]  are  generalized  to  a 
distributed  system  of  two  local  sensors  (or  detectors) .  The  resulcing 
tests  are  shown  to  maintain  the  desirable  nonparametric  property.  A 
numerical  example  is  presented  for  illustration. 

In  Chapter  Eight,  a  summary  of  results,  conclusions,  and  suggestions 
for  future  research  are  presented. 
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CHAPTER  TWO 


PRELIMINARIES 

In  this  chapter,  we  briefly  review  Wald's  sequential  probability 
ratio  test  (SPRT) .  Some  terminology  and  notation  are  introduced.  The 
modeling  of  SPRT  as  a  finite  state  Markov  chain  is  also  discussed. 

2,1.  Wald's  Sequential  Probability  Ratio  Test 

Consider  the  problem  of  testing  a  simple  hypothesis  versus  a  simple 
alternative  H^.  Let  f(x,9)  denote  the  probability  density  function  of 
the  random  variable  x  representing  the  observation  (sample)  under 
consideration.  Let  HQ  be  the  hypothesis  that  9  =  9Q,  and  the  hypo¬ 
thesis  that  9  =  9^.  Thus,  the  distribution  of  x  is  given  by  f(x,9_.)  when 
H j  is  the  true  hypothesis  , j=0,l.  The  successive  observations  on  x  are 
denoted  by  x^, x^, . . . x^ (n  >  1)  and  they  are  assumed  to  be  statistically 
independent  and  identically  distributed  (iid) .  For  any  positive  integer 
n,  the  conditional  probability  density  function  of  given  H_.  is  given 
by 


f x  /H. 
~n  j 


3=0,1  (2.1) 


where  x  =[x  x  . . .x  ]  is  the  observation  vector. 

— n  12  n 

Wald's  SPRT  for  testing  against  is  defined  as  follows:  Two 

positive  constants  t  and  t0 (t  >  t„)  are  chosen.  At  each  stage  of 

u  i  u  £  r 

testing  (n,  n  >  1) ,  the  likelihood  ratio  function  A  defined  below  is 

n 

computed. 
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A 

n 


(2.2) 


=  f  /f 

VH1  VH0 


The  likelihood  ratio  function  in  (2.2)  is  compared  to  the  thresholds  t 
and  tj,  as  follows: 


A 

n 


>  t  ,  stop  and  decide  H„ 
u  1 

'  <  t^  ,  stop  and  decide 
•  otherwise,  continue 


(2.3) 


The  choice  of  the  test  thresholds  depends  on  the  desired  values  of  the 
error  probabilities  a  and  (5,  where  a  is  the  probability  of  deciding 
when  Hq  is  true,  and  [3  is  the  probability  of  deciding  when  is  the 
true  hypothesis.  It  has  been  shown  [1]  that  the  approximation  of  the 
thresholds 


t  2  (1  -  P)/o 

u 


(2.4) 


tt  =  p/(l  -  a) 

is  sufficiently  accurate  for  all  practical  purposes  and,  consequently,  it 
i3  widely  used  in  the  literature. 

The  monotonicity  of  the  logarithm  function  enables  us  to  derive  a 
more  convenient  test  by  simply  taking  the  logarithm  of  (2.3).  The  re¬ 
sulting  test  is  given  by 


(£  log  t  ,  stop  and  decide 

<  log  t^  ,  stop  and  decide  HQ  (2.5) 

otherwise,  continue 


Taking  the  logarithm  of  the  likelihood  function  in  (2.2),  we  obtain 


n 

log  A^  =  y~  log 
i=l 


f  (x^G^) 

f<V9o> 


(2.6) 


9 


(2.7) 


Denoting  the  ith  term  of  the  sum  in  (2.6)  by  Z^,  i.e., 


Z. 

1 


fui-9i 

fu.,e0 


) 
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the  test  procedure  can  be  described  as  follows:  At  each  stage  of  the 
test  (n,  n  >  1) ,  the  sum  Z^  +  Z^  +  ...  +  Z^  is  computed,  and  the  follow¬ 
ing  test  is  performed 

>  log  t  ,  stop  and  decide  H, 

Z .  \  log  t.  ,  stop  and  decide  H  (2.8) 

£=1  1  I  .  0 

'  otherwise,  continue 

The  performance  of  an  SPRT  is  characterized  in  terms  of  the  operating 
characteristic  (OC)  and  average  sample  number  (ASN)  functions.  These  are 
discussed  next . 


2.2.  The  OC  Function  of  the  SPRT 

The  operating  characteristic  function  L(0)  of  an  SPRT  is  defined  as 

the  probability  that  the  sequential  testing  procedure  will  terminate  with 

the  acceptance  of  the  hypothesis  when  9  is  the  actual  value  of  the 

parameter.  This  is  necessary  in  situations  where  the  exact  knowledge  of 

the  parameter (s)  of  the  distribution  function  is  not  available.  An 

approximate  formula  for  L(9),  neglecting  the  excess  of  the  likelihood 

function  A  over  the  test  thresholds  t  and  t„  at  the  termination  stage 
n  u  t 

is  given  by  [1] 


L(0) 


h  (0) 
c 

u 

h  (0) 
u 


-  1 


,h  (0) 
l 


(2.9) 
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where  h ( 9 )  is  the  unique  nonzero  solution  of  the  equation 


CO 

f(x,01)‘ 

f  <x/e0> 

h  (0) 


f (x,9)  dx  =  1 


(2.10a) 


when  x  is  a  continuous  random  variable,  and  h(8)  is  the  unique  nonzero 
solution  of  the  equation 


L 


f (X, 01> 


f  <X'0Q> 


h(0) 


f (x, 6)  =  1 


(2.10b) 


when  x  is  a  discrete  random  variable.  The  summation  in  (2.10b)  is 
clearly  over  all  possible  values  of  x. 

From  (2.10a)  and  (2.10b),  it  is  obvious  that  h(0g)=l  and  h(0  )=-l. 
Therefore,  it  follows  from  (2.9)  that  for  t^  and  t^  as  given  by  (2.4),  we 
have  L (0 Q )  =  1-a,  and  1,(0^)  =  P  as  required. 


2.3.  The  ASN  Function  of  the  SPRT 

Let  n  denoue  the  number  of  observations  required  for  the  termination 
of  the  sequential  test,  and  let  Eg (n)  denote  the  expected  value  of  n  when 
0  is  the  true  value  of  the  parameter.  E0(n)  is  a  function  of  9  and  is  de- 
fined  co  be  the  average  sample  number  (ASN) .  Following  Wald  [1]  in  neg¬ 
lecting  the  excess  of  A  over  the  thresholds  at  the  termination  stage,  it 

n 

can  be  shown  that  Eg (n)  is  given  by 


E0(n)  ^ 


L (9)  log  t.  +  (1-  L (0) ]  log  t 

_ t _  U 

~  vv 


,  EgtZJ  *  0  (2.11) 


wnere  Z.  is  as  defined  by  equation  (2.7),  and  Ea(Z.)  is  given  by 
l  U  i 


OO 

'f  (x,©^)' 

w  = 

log 

f(x,0o) 

—  OO 


f(x,8)  dx 


(2.12) 
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when  x  is  a  continuous  random  variable.  When  x  is  a  discrete  random 
variable,  equation  (2.12)  becomes 


W  =  H  l0g 

X 


'f  (x,e1)- 
f <x,8Q) 


f (x,8) 


(2.13) 


When  Ea(Z.)  =  0,  we  denote  9'  the  value  of  9  such  that  Ert(Z.)  =0.  In  this 
Hi  Hi 

case  (1],  the  expression  for  Eg  ,  (n)  is  given  by 


(n)  = 


log  t£  •  log  t^ 
E9'(Z?i> 


(2.14) 


and  the  corresponding  OC  function  is  given  by 


L  (8 ' ) 


tt 


log  t 

_  u _ 

log  tu  -  log  t£ 


(2.15) 


It  should  be  emphasized  that  the  expressions  for  the  OC  and  ASN  func¬ 
tions  as  given  by  (2.9)  and  (2.11)  respectively,  are  approximate  in  that 
they  were  derived  under  the  assumption  of  no  excess  over  the  test  thresh¬ 
olds.  It  is  possible  to  derive  the  exact  expressions  for  the  OC  and  ASN 
functions  only  when  the  random  variable  Z^  defined  by  (2.7)  can  take  a 
finite  number  of  values  which  are  integral  multiples  of  a  constant. 

Using  the  characteristic  function  of  the  test  duration,  Wald  [1]  derived 
expressions  for  the  probability  distribution  of  the  test  duration  as  well 
as  exact  expressions  for  the  OC  and  ASN  functions.  The  solution  of 
Wald's  equations  has  been  simplified  by  Girshick  [30]  who  reduced  the 
solution  to  that  of  solving  a  set  of  simultaneous  equations.  In  the  re¬ 
mainder  of  this  chapter,  we  present  a  totally  different  approach  to  the 
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computation  of  the  exact  values  of  the  OC  and  ASN  functions  as  well  as 


the  exact  distribution  of  the  test  length  (duration) .  The  approach  in 
the  next  section  is  based  on  a  formulation  of  the  sequential  test  as  a 
random  walk  on  a  finite  number  of  states  using  the  transition  probability 
matrix. 

2.4.  The  Transition  Probability  Matrix  Formulation  of  the 
Sequential  Test 

In  this  section,  we  describe  an  alternate  approach  for  the  evalu¬ 
ation  of  the  exact  distribution  of  the  sequential  test  length  as  well  as 
the  corresponding  OC  and  ASN  functions.  The  approach  requires  that  the 
step  as  defined  by  (2.7)  be  finite-valued  and  takes  values  that 
are  integral  multiples  of  a  constant.  This  facilitates  the  formulation 
of  the  SPRT  as  a  random  walk  (Markov  chain)  on  a  finite  number  of  states 
and  its  complete  description  by  the  specification  of  its  transition  proba 
bility  matrix.  Proakis  [28]  described  the  random  walk  formulation 
approach  and  derived  the  exact  distribution  of  the  test  length  T  and  the 
ASN  for  quantized  radar  signals.  The  same  approach  is  described  in  more 
detail  in  [29]  which  is  the  basis  for  the  material  presented  here. 

Consider  a  Markov  chain  with  a  state  space  made  up  of  (N+l)  states 

given  by  0,1,2,...,N.  Let  S  denote  the  state  at  stage  n  and  pf^  denote 

n  13 

the  probability  of  being  in  state  j  after  n  transitions  starting  in  state 
i,  i.e., 

Pij)  =  Pr  {Sn  =  j/S0  =  i}  (2-16> 

States  0,1,..., r-1  are  transient  in  that  lim  P  =  0  for  0<i,j<r-l, 

n-*»  i  j 

while  states  r,r+l,...,N  are  absorbing,  and  P  =  1  for  r<i<N.  As  n 


13 


approaches  the  process  will  ultimately  be  absorbed  in  one  of  the 
absorbing  states. 

The  stochastic  process  is  described  by  the  transition  probability 

matrix  whose  (ij)th  element  is  pfV  =  P.  .  and  which  can  be  represented  in 

13  13 

the  partitioned  form  as  follows 


P  = 


Q 

10 


IJ 


(2.17) 


where  £  is  an  (N-r+1)  x  r  matrix  all  of  whose  components  are  zero,  I_  is 

an  (N-r+1)  x  (N-r+1)  identity  matrix  and  Q. .  =  P. .  for  0<i,  j  <  r-1. 

13  13 

A  straightforward  matrix  multiplication  shows  that  the  nth  power  of  P 
is  given  by 


Pn  = 


Qn  (I+Q+. .  .+Qn_1)R' 


(2. IS, 


L  0 


Let  Wn  =  l  +  Q  +...+  Qn,upon  rewriting  wn,  we  obtain 


Wn=I+Q(I+Q+..,+  Q11"1) 


=  I  +  Q  W 


,n-l 


In  the  limit,  we  have 


W  =  lim  W 

—  — 


Combining  (2.19)  and  (2.20),  we  arrive  at 

W  =  I  +  Q  W 

which  can  be  written  as 


(2.19) 


(2.20) 


(2.21) 


(I  -  Q)W  =■  1  (2.22) 

From  (2.22),  we  observe  that  W  =  ( 3>Q)  the  inverse  of  (£-Q) .  The 
matrix  W  is  called  the  fundamental  matrix  associated  with  Q.  The  ASN  is 
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obtained  by  counting  the  number  of  visits  to  all  transient  states  knowing 
that  the  process  started  in  the  ith  transient  state.  In  other  words,  we 
have 

r-1 

ASN  =  Y~  W. .  ,  0  <  i  <  r-1  (2.23) 

k  13 

Turning  to  the  termination  probabilities,  i.e.,  the  probabilities  with 
which  the  process  will  be  absorbed  in  one  of  the  absorbing  states. 

Recall  that  the  states  k=r,  ...,tJ  are  absorbing.  Since  such  a  state 
cannot  be  exited  once  entered,  the  probability  of  absorption  in  a 
particular  absorbing  state  k  up  to  time  n  and  starting  from  initial  state 
i,  is  simply 

P.(f°  =  Pr{ S  =  k/S.  =  i}  ,  0  <  i  <  r-1  and  r  <  k  <  N  (2.24) 

lk  n  0 

In  terms  of  the  test  length  T,  we  can  write 

=  Pr{T  <  n  and  ST  =  k/SQ  =  i}  , 0  <  x  <  r-1  and  r  <  k  <  N  (2.25) 

where  T  =  min  {n  £  0:  r  £  S  <  N}  is  the  termination  time.  Let 

n 

Uik>  =  Pr^T  ~  n  and  ST  =  k/S0  =  i^'  0  -  1  -  r"1  and  r  <  k  <  N  (2.26) 

Referring  to  (2.18),  (2.24)  and  (2.25),  we  give  the  matrix  un  by 

Un  =  (I  +  Q  +  ...  +  Qn_1)R  (2.27) 

which  can  be  simplified  using  (2.19)  to  obtain 

Un  =*  wn_1R  (2.28) 

Taking  the  limit  as  n  ->  <*>,  we  obtain  the  absorption  probability  matrix 
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U  in  terms  of  the  fundamental  matrix  W  as  simply  U  =  W  R,  or 

r-1 

U  =  V"  W,  .  R  ,  0  <  i  <  r-1  and  r  <  k  S  N  (2.29) 

1K  ^ 

It  should  be  emphasized  that  equation  (2.29)  gives  the  exact  value  of  the 
termination  probability  to  state  k  starting  from  state  i,  and  therefore,  it 
can  be  used  to  ensure  the  specified  error  probabilities..  On  the  other 
hand,  (2.26)  through  (2.28)  give  the  cumulative  probability  of  termina- 
t ' on  in  state  k  starting  from  state  i.  Therefore,  summing  over  all  k,  we 
obtain 

N 

Pr{ T  £  n}  =  y  uy  (2.30) 

k~ r 

which  gives  the  cumulative  probability  distribution  of  the  test  length 
given  that  the  process  started  in  state  i,  i.e.,  SQ  =  i. 

In  the  above  analysis,  we  assume  that  the  transition  probability 
matrix  is  given  and  obtain  the  ASN  and  termination  probabilities. 

However,  while  applying  the  theory  to  the  sequential  hypothesis  testing 
problem,  it  is  clear  that  we  have  to  deal  with  two  transition  probability 
matrices  each  corresponding  to  one  of  the  two  hypotheses.  Moreover,  the 
set  of  absorbing  states  has  to  be  partitioned  into  two  groups  each 
corresponding  to  a  decision  in  favor  of  one  of  the  hypotheses.  The  OC 
function  can  be  easily  found  by  summing  the  probabilities  of  termination 
in  all  the  states  assigned  to  decision  provided  that  the  transition 
probability  matrix  which  represents  the  true  parameters  is  used. 

This  formulation  will  be  used  in  the  later  chapters  to  illustrate  the 
concepts  developed. 
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CHAPTER  THREE 


A  SEQUENTIAL  PROBABILITY  RATIO  TEST  BASED 
ON  MULTI SENSOR  DATA 

3.1.  Introduction 

In  binary  hypothesis  testing,  it  is  well  known  that  sequential  test 
procedures  provide  a  significant  advantage  over  fixed-sample  size  (FSS) 
test  procedures  [1,3-5].  For  prespecified  values  of  the  error  probabil¬ 
ities  a  and  P  (where  a  is  the  probability  of  error  of  the  first  kind  and 
p  is  the  probability  of  error  of  the  second  kind) ,  the  sequential 
procedures,  on  the  average,  require  a  substantially  less  number  of  obser¬ 
vations  (samples)  than  FSS  test  procedures.  The  sequential  probability 
ratio  test  (SPRT)  of  Wald  [1]  is  known  to  be  optimal  among  all  possible 
sequential  test  procedures. 

In  this  chapter,  we  generalize  Wald’s  SPRT  to  a  distributed  system 

consisting  of  M  local  sensors  and  a  global  (central)  decision  maker  as 

shown  in  Fig.  3.1.  In  Section  3.2,  we  define  the  observation  model, 

formulate  the  centralized  SPRT  based  on  multisensor  data,  and  derive  an 

expression  for  the  global  average  sample  number  ASN_.  when  the  hypothesis 

H.,  j=0,l,  is  true.  Moreover,  we  show  that  ASN4  is  a  monotonically 
3  j 

decreasing  function  of  the  number  of  local  sensors  used.  In  most 
multisensor  detection  systems,  a  bandwidth  constraint  on  the  com¬ 
munication  channels  carrying  data  from  local  sensors  to  the  global  de¬ 
cision  maker  (GDM)  is  assumed.  Therefore,  it  is  often  necessary  to  com¬ 
press  the  data  locally  prior  to  transmission.  In  Section  3.3,  we  quantize 
each  of  the  local  observations  into  two  levels  and  transmit  the  quantized 
value  to  the  GDM  for  further  processing.  The  issue  of  optimal 


17 


Fig.  3.1:  A  distributed  system  consisting  of  M  local  sensors 
and  a  global  decision  maker. 
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quantization  is  studied  in  some  detail,  and  it  is  shown  that  a  globally 
optimum  design  of  the  quantizers  is  obtained  by  optimizing  the  individual 
local  quantizers  independently.  Also,  it  is  shown  that  an  optimal  local 
quantizer  is  based  upon  the  quantization  of  the  local  likelihood  ratio 
(LR) .  In  Section  3.4,  we  show  that  ASN^  is  a  monotonically  decreasing 
function  of  the  number  of  bits  used  for  local  quantization.  In  Section 
3.5,  we  consider  a  distributed  system  consisting  of  two  identical  local 
sensors.  The  SPRT  at  the  GDM  is  formulated  as  a  random  walk  [28,29]  on  a 
finite  state  Markov  chain  and  the  effect  of  transmission  errors  is  inves¬ 


tigated.  Two  cases  are  studied.  In  the  first  case,  it  is  shown  that  when 
the  transmission  errors  are  neglected,  the  result  is  an  increase  in  the 
error  probabilities  over  their  specified  values.  It  is  shown  in  the 
second  case  that  when  the  transmission  errors  are  incorporated  in  the  de¬ 
sign,  the  specified  error  probabilities  are  satisfied  at  the  expense  of  an 


increased  ASN_.,  j=0,l.  In  Section  3.6,  we  present  some  numerical  re¬ 
sults  for  the  system  described  in  Section  3.5.  Finally,  the  results 


obtained  in  this  chapter  are  discussed  in  Section  3.7. 


3.2.  A  Centralized  SPRT  for  Multisensor  Data 

Consider  the  multisensor  distributed  system  shown  in  Fig.  3.1  which 
consists  of  M  local  sensors  and  a  global  decision  maker  (GDM) .  The 
problem  under  consideration  is  that  of  testing  a  simple  hypothesis 
versus  a  simple  alternative  .  Let  f(x^,9j  denote  the  probability  den¬ 
sity  function  (pdf)  of  the  random  variable  X^,  i=l,2,...,M,  which  repre¬ 
sents  the  observation  at  the  ith  local  sensor.  Let  be  the  hypothesis 
that  9^  =  9^,  and  the  hypothesis  that  9^  =  9^.  Therefore,  the  pdf  of 

X.  is  given  by  f(x.,9. ,)  wher  H.,  i =0 , 1 ,  is  true.  The  successive 
i  iij  j  J 
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observations  on  X,  are  denoted  by  x.„,  x._, (n  >  1)  and  they  are 
i  ll  i2  m 

assumed  to  be  iid.  The  observations  are  also  assumed  to  be  independent 

from  one  sensor  to  the  other,  i.e.,  x.  is  independent  of  x.,  (m  *  k) ,  and 

1m  i)c 

x„  (£  *  i) .  Each  local  sensor  is  assumed  to  process  its  own  observa- 
tm 

tion(s)  and  transmit  the  analog  value  of  the  local  test  statistic  to  the 

GDM.  At  any  observation  time  n(n  >  1),  the  GDM  computes  the  central 

likelihood  ratio  (CLR)  function  A  as  follows 

cn 


A 

cn 


A 


M 

IT 


IT 

k=l 


flxik-  6ii»i 


£Uik'  Vj 


(3.1) 


where  f  (x , ,  ,  8 .  . ) 

lk  i  j 


f  (x. 


0.  .) 

ID 


Equation  (3,1)  can  be  written  as 


A 

cn 


M 

IT 

i=l 


f  n-l  f  (x  ,  8  )  f  (x.  ,  0  ) 

TT  xk  il  m  il 


Lk-l  £(>tik'  0iO> 


f(x.  ,  0..) 
m  iO 


(3.2) 


which  is  recognized  as  the  product  of  the  CLR  at  the  (n-l)th  stage 

A  ,  , ,  and  the  nth  increment  A°  ,  where  An  is  given  by 
c (n-l)  c  c  *  J 


Hence 


cn 


M 

TT 

i=l 

f  (x.  f 
m 

eil> 

t  (x.  , 
m 

9i0> 

Ac (n-l) 

•  A 

c 

compares  A 

with 

(3.3) 


and  B  as  follows 


cn 


>  A  ,  stop  and  decide  H ^ 
<  B  ,  stop  and  decide  H 
otherwise  ,  continue 


(3.4) 


where  A 


and  B 


*  JL 

l-a 


are  the  test  thresholds . 


From  the  monoto- 
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nicity  of  the  logarithm  function  and  the  fact  that  A  as  well  as  A 

cn 

and  B  are  always  positive,  a  more  convenient  test  is  derived  by  taking 
the  logarithm  of  both  sides  of  (3.4)  to  obtain 


log  A 


cn 


>  log  A  , stop  and  decide  H 
‘  <,  log  B  ,  stop  and  decide 
■otherwise,  continue 


(3.5) 


Taking  the  logarithm  of  the  CLR  in  (3.2),  we  obtain 


log  A  =  ) 

cn  I— 

i=l 


n-1  f(x  ,  6.)  f(x.  ,  0  ) 

,  ik  il  xn  il 

)  lQg  ttz - 5 — r  +  lQg 


k=l 


£<*ik'  9i0> 


Uxin'  Vj 


(3.6) 


f (x.  ,  0  ) 

Let  2  ■  V  =  lo9  f  1 - ft1  — 

lk  £Uik'  9i0 


, i=l, 2, . . . ,M;  kS  1,  to  obtain 


M 


log  A  =  Y~ 
cn  L— 

i=l 


r  "f-1 

)  2..  +  Z. 


k=l 


M  n-1  M 

r  r  »lk  ♦  r «, 


m 


(3.7) 


i=l  k=l 


i=l 


Equation  (3.7)  shows  that  the  local  test  statistic,  k  >  1,  is  simply 

the  logarithm  of  the  local  LR  function.  In  other  words,  at  any  stage  of 

the  test  each  local  sensor  computes  the  logarithm  of  its  LR  function 

based  on  the  current  observation  and  transmits  the  result  to  the  GDM 

The  GDM  employs  the  received  local  test  statistics  to  update  its  test 

statistic  log  A  according  to  (3.7).  Because  the  successive  observa- 
cn 

tions  at  the  ith  sensor,  i=l,2,...,M,  are  iid  random  variables,  it 
follows  that  Z^,  k  >  1,  is  also  a  sequence  of  iid  random  variables. 

Assuming  no  excess  over  the  test  thresholds  [1],  it  follows  that  when 
H.  is  true  and  the  test  terminates,  we  have 


21 


log  A  = 
cn 


log  A  ,  with  probability  p , 


log  B  ,  with  probability  ( 1  — p ) 


(3.8) 


ire  pQ  =  a  ana  p  s  { 1 — P ) .  Equating  the  expected  values  of  both  sides 


of  (3.8),  we  obtain 


n  M 


a  r  .  jLi  ^ 

E [log  A  ]  =  E  y  Y  Z .  =  p.  log  A  +  (1-p.)  log  B  (3.9) 

cn  L  i^l  1  J  3  3 


n  M 

However,  )  Y  2  is  a  random  sum  of  iid  random  variables.  Therefore, 
its  expected  value  is  known  t29]  to  be 


n  M 


E  [Y  V  Zik  ]  =  E[n]  *  E[  Y  ZJ  =  E[n^  •  Y  (3.10) 


Therefore,  ASN_.  =  E(n/n^]  is  given  by 


p.  log  A  +  (1-p.)  log  B 

ASN  =  -1— R - ] -  ,  j-0,1 

3  .1  EtZ./H.) 

i=l  i  1 


(3.11) 


Since  ASN_.,  j=0 , 1 ,  is  a  positive  real  number,  it  follows  that  the  denomi¬ 
nator  and  numerator  in  (3.11)  must  have  the  same  sign.  Moreover,  it  is 
clear  that  if  the  ith  sensor  is  used  alone,  then  the  denominator  in  (3.11) 
i3  E(Z./H.],  i=l,2,..,,M.  Therefore,  E[ZVH_J  must  have  the  same  sign  as 
the  numerator  of  (3.11)  provided  that  the  ith  sensor  observations  are 
sufficient  for  detection,  i.e.,  the  ith  sensor  is  reliable  for  detection. 
The  reciprocal  of  ASN^  in  (3.11)  is  given  by 


(ASN  .) 
3 


.E.  E1Z./H.) 

_ ^  1  3 _ 

P.  log  A  +  (1-p.)  log  B 
3  3 


,  j=0,l 


(3.12) 
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(3.13) 


which  shows  that  (ASN,)  ,  the  inverse  of  ASN_.  is  a  monotonically  in¬ 


creasing  function  of  the  number  of  local  sensors.  Consequently,  it 


follows  that  ASN_.  is  a  monotonically  decreasing  function  of  the  number  of 
local  sensors  used. 


The  centralized  SPRT  considered  above  is  optimal  in  the  sense  that  no 
quantization  was  employed.  However,  its  implementation  requires  an  error 
free  transmission  of  analog  observations  or  their  analog  sufficient  sta¬ 
tistics  which  is  practically  impossible  to  achieve.  Therefore,  we  focus 
our  attention  in  the  next  section  on  the  case  when  each  local  sensor  quan¬ 
tizes  each  of  its  individual  observations  into  a  binary-valued  variable 
prior  to  transmission. 


3.3.  The  Multisensor  Centralized  SPRT  with  Quantized  Data 

In  the  previous  section,  it  was  assumed  that  each  local  sensor  is 
capable  of  computing  the  likelihood  ratio  function  at  its  location  and 
transmitting  the  exact  computed  value  to  the  central  processor  exactly. 

In  this  section,  we  assume  that  each  .local  sensor  quantizes  its  individual 
observations  into  two  distinct  levels  (one  bit  quantizer),  namely  0  and  1. 
The  quantized  local  observations  are  communicated  to  the  GDM  where  a 
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centralized  SPRT  is  performed.  The  binary  quantization  of  the  local 
observations  greatly  reduces  the  communication  channel  bandwidth 
required.  In  addition,  it  simplifies  the  implementation  of  the  SPRT  at 
the  GDM . 


Consider  the  system  shown  in  Fig.  3.2,  which  consists  of  M  local 
sensors  followed  by  M  local  quantizers.  The  analog  observations  are  as 
defined  earlier  in  Section  3.2.  The  ith  quantizer  maps  the  successive 
observations  of  the  ith  local  sensor  into  a  sequence  of  0's  and  l's. 
Therefore,  the  hypothesis  testing  problem  is  given  by 


yik 


B(PiO> 


i=l, 2, . . . , ,  and  k  >  1 


(3.15) 


H1  :  yik  S  B(pil> 


where  y^^  is  the  random  variable  representing  the  output  of  corre¬ 
sponding  to  the  kth  observation.  B(p. .)  is  a  Bernoulli  random  variable 

with  probability  of  success,  i.e.,  y.,  =  1,  equal  to  p. .  when  H.  is  true. 

lk  l  j  j 

The  probability  p_  is  given  by 


P 


ij 


f (x . ,  0 . . )  dx. 
i  x  j  i 


(3.16) 


where  R^  is  the  region  of  the  observation  space  in  which  y^  is  assigned 

the  value  1.  Clearly,  the  design  of  the  ith  quantizer  involves  a 

unique  determination  of  the  region  R, . . 

li 

The  centralized  SPRT  is  performed  at  the  central  processor  in  a 

manner  identical  to  that  described  in  Section  3.2.  The  only  difference 

is  that  Z.,  =  Z.  is  now  a  discrete  two-valued  random  variable  whose  dis- 
lk  i 

tribution  when  H.  is  true  is  given  by 
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Fig.  3.2:  A  distributed  system  consisting  of  M  local  sensors 

followed  by  M  local  quantizers  and  a  global  decision  maker. 
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2ik  =  < 


log [  (p^)  /  (p^)  ]  ,  with  probability  p 


log  {  ( 1-p  ,  /  (  I'P^q)  3  /  with  probability  1-p 


(3.17) 


From  (3.17),  we  observe  that  the  operation  of  the  central  processor  can 

M 

be  described  in  terms  of  a  finite  number  of  possible  steps  ZJ  , 

M  M 

i.e.,  there  are  2  possible  steps  corresponding  to  the  2  possible  combi¬ 
nations  of  the  local  observations.  The  values  of  these  possible  steps 
are  known  and  can  be  stored  in  a  read-only-memory.  To  derive  expressions 
for  the  ASN's  we  follow  the  same  procedure  as  used  in  deriving  equation 
(3.11).  For  the  quantized  observations  case,  (3.11)  can  be  written  as 
follows 


ASN.  = 


P  ■  log 

JL  « 


1-3  3 

— &  +  (1-p^)  log 


3  Jl  [  pij  logt(Pi;L)/(Pi0)]+  U-p..)  log[(l-p.1)/(l-p.0)]_ 


(3.18) 


It  should  be  emphasized  that  (3.18)  is  derived  under  the  assumption  of  no 
excess  over  the  test  thresholds  for  given  quantizers. 

Next,  we  address  the  issue  of  optimal  design  of  the  quantizers  Q  , 

^2'  '  and  show  that  the  optimal  quantization  is  obtained  by  optimiz¬ 

ing  the  local  quantizers  independently.  To  this  end,  let  denote  the 
denominator  of  (3.18)  when  H.  is  true,  and  let  D. ,  denote  the 
contribution  of  the  ith  quantizer  to  D . .  Therefore,  we  can  write 


n 

~  j 


D.  . 
13 


(3.19) 


Note  that  the  numerator  of  (3.18)  is  fixed.  Therefore,  in  order  to 
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minimize  ASN^,  must  be  maximized.  However,  maximization  of  the  sum  of 
any  number  of  functions  is  equivalent  to  the  maximization  of  the  summands 
or  individual  terms.  In  other  words,  we  can  write 


M 


max  [D  _.  ]  =  )  max  [D^] 

Ql'Q2' ‘ ’ ‘,QM  Qi 


(3.20) 


Let  R*  be  the  reaion  R, .  that  corresponds  to  the  optimal  quantizer  Q* , 
li  '  li  i 

and  let  p£_.  be  as  defined  in  (3.16)  with  R^  being  replaced  by  R*^. 
Suppose  that  H  is  the  true  hypothesis,  then  with  the  optimal  choice  of 
QVs  the  maximum  value  of  D  ,  D*,  is  given  by 


Di 


A 


max  [D.]  = 

Q1'V  ‘  ■  "Qm 


(1 


(3.21) 


The  expression  for  the  maximum  value  of  D^,  D*,  is  obtained  from  (3.18) 


and  is  given  by 


°0  = 


m  f  p* 

y  ko iog  +  u  ■  pio> 

in  no 


(3.22) 


Observe  that  the  optimization  of  D  does  not,  in  general,  lead  to  an 
optimal  Dq  and  vice  versa.  In  other  words,  we  can  optimize  either  or 
Dq  or  a  weighted  sum  of  them,  i.e.,  we  can  optimize  either  ASN^  or  ASNQ 
or  the  weighted  sum  given  by 

ASN  =  y  ASN1  +  ( 1— Y )  ASNq  ,  Y  £[0,1]  (3.23) 

Next,  we  show  that  an  optimal  local  quantizer  is  a  likelihood  ratio 
quantizer.  We  focus  our  attention  on  the  ith  quantizer  whose 
contribution  to  the  denominator  of  (3.18)  is  given  by 
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„  ,  pil  „  ,  ,  l_pil 

Dij  *  pij  109  *  {1-pii'  109 

J  J  ^iO  *  lO 


(3.24) 


When  is  the  true  hypothesis,  we  can  write  (3.24)  as  follows 


=*  p.,  log  — —  +  (1-p..)  log- - = 

il  *il  p.A  il  1-P, 


1_pil 


(3.25) 


We  treat  p^Q  as  a  constant,  and  differentiate  (3.25)  with  respect  to  p^ 
to  obtain 


’u  /.  pu  .  .  '-Pii  ,\  .  pu  .  '-fit 

-  -  {log  —  *  1-log  —  -  1)  -  log  —  ♦  log  — 


(3.26) 


For  p.,  >  p.„,  p.,  and  p.„  £  (0,1),  it  is  clear  that  p.,/p._  >  1  and 
il  iO  il  iO  il  iO 

(1-p. .) / (1-p. , )  >  1.  Therefore,  we  conclude  that 
iO  il 


dD  p  (1-p  ) 

5^7  *  109  [T-i-p1  .■  9  0  ■  £or  pii  ar-d  pio  e  (0-1)  l3-271 

il  iO  1 1 


Similarly,  when  is  the  true  hypothesis,  we  can  write  (3.24)  as  follows 


Di0  ■  pi0  109  J-  *  (1'Pio’  109 

xO  iO 


(2.28) 


Differentiating  (3.28)  with  respect  to  p^,  we  obtain 


^iO 

/  Pi0 

dpil 

=  {p'»  •  SIT 

(-  u\ 

.  ^i0 

'  Pil 

i-p  / 
*i0 

"  Pil 

(1-pil) 

(3.29) 


dOiO 

Equation  (3.29)  shows  clearly  that  — -  is  negative  for  p.„  >  p.„  and 

dp^  il  iO 

nontrivial  quantizer,  i.e.,  p^  and  p  £  (0,1).  The  numerator  of  the 
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rioht  hand  side  in  (3.18)  is  determined  by  the  values  of  error  proba¬ 


bility  a  and  p.  For  small  values  of  a  and  p,  it  is  clear  that  when 


H  (H  ;  is  true,  the  numerator  of  (3.18)  is  positive  (negative).  There¬ 
fore,  differentiating  (3.18)  with  respect  to  p^  and  observing  that  the 
numerator  which  is  denoted  by  N(a, P)  is  not  a  function  of  p^.,,  we  obtain 


d(ASN  ,) 
_ 1_ 

dp,  . 


-N«X,P)  . 

D2  . 

1D 


fiO 

dpil 


0, 


for  j =0 , 1 


(3.30) 


From  (3.30),  we  conclude  that  for  any  fixed  p^Q  and  nontrivial  quantizer 
(p^  >  p^0  and  p^,  p^0  £  (0,1)),  ASN_.  is  a  monotonically  decreasing 
function  of  .  Consequently,  the  optimum  quantizer  is  the  one  which 
maximizes  p  corresponding  to  a  fixed  p^  .  This  quantizer  is  obviously 
the  one  that  employs  a  Neyman-Pearson  criterion  (likelihood  ratio 
quantizer)  with  a  probability  of  false  alarm  equal  to  piQ  (0  <  piQ  <  1) . 


3.4.  Performance  Enhancement  by  Means  of  Multi-level  Quantization 

The  quantizers  considered  so  far  were  binary  valued.  Since  the 
quantizer  does  not  discriminate  between  the  observations  in  any  single 
region  of  its  two  regions,  the  quantization  leads  to  a  performance 
degradation  as  reflected  by  an  increase  of  both  ASN^  and  ASNQ .  However, 
this  performance  loss  can  be  reduced  by  further  partitioning  of  the  obser¬ 
vation  space,  i.e.,  by  allowing  more  quantization  levels.  In  this  section,  we 
show  that  ASNj,  j=0,l,  is  a  monotonically  decreasing  function  of  the 
number  of  quantization  levels  when  the  quantization  is  based  on  the  like¬ 
lihood  ratio  function. 

To  prove  our  claim,  it  suffices  to  show  that  given  any  region  R  of  the 
observation  space,  where  R  is  specified  in  terms  of  the  two  thresholds  t 
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and  (t  ^  <  t^)  such  that  for  any  point  x  f  R,  we  have  LR(t^)  <  LR(x)  < 

LR(t2)  as  shown  in  Fig.  3.3.  Then  as  a  result  of  partitioning  R  into 

two  regions,  the  denominator  in  the  expression  of  ASN_.  in  equation  (3.11) 

increases  when  is  true  (j=l)  and  decreases  when  is  true  (j=0) .  Let 

p  ,  j=0,l,  be  the  probability  of  an  observation  falling  in  R  when  H ^  is 

true  with  0  <  p  .  <  1.  Since  the  local  quantizers  were  designed  inde¬ 
nt 

pendently  as  shown  in  the  previous  section,  it  follows  that  the  region 

R  can  be  specified  at  any  of  the  local  observation  spaces.  Therefore, 

in  what  follows  we  simply  drop  the  index  i  of  the  local  quantizers  for 

convenience.  Let  the  region  R  be  partitioned  into  two  mutually  exclusive 
1  2 

subregions  R  and  R  such  that 

1  x  2 

prl  1  rl  *rl 

(3.31) 

Pr0  PrC  +  Pr0 

where  p1 .  is  the  probability  of  an  observation  falling  in  the  3ubregion 
D 

R1,  i  =  1,2,  when  H^,  j=0,l,  is  the  true  hypothesis.  We  assume  without 

2 

loss  of  generality  that  the  likelihood  ratio  at  any  point  in  R  is  higher 
or  at  least  equal  to  its  maximum  in  R'*' .  Therefore,  it  follows  that 


2  1 

Pr  1  Pr  1  Pr  1 

2  p  1 

Pr0  r°  Pr0 


(3.32) 


Upon  writing  the  expression  of  ASN^  as  in  (3.18)  after  deleting  the 
quantizer  index  i  and  looking  at  the  contribution  of  the  region  R  to  the 
denominator,  we  observe  that 


1 

D  .  =  £  +  p  .  log - 

3  rj  prQ 


(3.33) 


where  e  i.3  the  contribution  of  all  the  regions  in  the  observation  space 

1  2 

other  than  R.  Similarly,  the  contribution  of  the  subregions  R  and  R 
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LR(x) 


Fig.  3.3:  A  region  R  of  the  observation  space  such  that 
LR(tj)  <  LR(x)  <  LR(t?) 
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can  be  incorporated  in  to  obtain 


1  2 

1  Prl  2  Prl 

e  +  p  log  —  +  p  log  — 

Pr0  Pr0 


(3.34) 


Our  objective  is  to  3how  that  ASN_.,  j=0,l,  is  a  monotonically  decreasing 
function  of  the  number  of  quantization  levels.  Therefore,  comparing 
(3.33)  with  (3.34),  we  conclude  that  in  order  to  prove  the  claimed  mono¬ 
tonicity  it  suffices  to  show  that 

1  2 


1  ,  rl  2  rl  .  rl 

p„,  log  ~r  +  p „ i  log  T"  >  p„,  i°g  ~ — 


(3.35) 


Pr0  109 


1 

2 

Prl  2 

Prl 

,  Prl 

1“  +  PrO 

109  ~T  <  PrO 

9  ri 

PrO 

PrO 

rO 

(3.36) 


To  prove  the  inequality  (3.35)  which  corresponds  to  the  hypothesis 

2  2 

being  true,  we  assume  that  p  =  c,p  ,  and  p  =>  c  p  ,  where  c  >  c  as 
*  rrl  1  rl  rO  o  rO  1  o 

a  consequence  of  (3.32).  The  left  hand  side  of  (3.35)  can  easily  be 
written  as 


(1-c  )p  1  c  p  1 

l.h.s.  -  (i-c  >  pc1  109  Titrrr-  +  cipn  105  rr: 

o  rO  o  rO 


(3.37) 


Simplifying  (3.37),  we  obtain 


p  1  “*0  p  c 

L.H.s.  =  (i-c  >  Prl [log  f1  +  log  irr]  +  cipn[log  *  log  r\ 

^rO  o  rO  o 

=  Prliog  fZ  +  Prll  (1'C1)  109  T^T  +  C1  109  ~J 


(3.38) 
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The  first  term  on  the  R.H.S.  of  (3.38)  is  equal  to  the  R.H.S.  of  (3,35). 
Therefore,  the  proof  of  (3.35)  is  complete  once  the  second  term  on  the 
R.H.S.  of  (3.38)  is  shown  to  be  positive.  We  observe  that  p  >  0,  and 
use  the  inequality  log  x  <  x-1,  x  *  1,  to  obtain 


1~c] 

(1-c!)log 


and 


-  a-v  [it? -1] 

c,  c  c 

1  o  o 

log  —  =  -  c,  log  —  >  -  c,  ( —  -  1) 
c  1  c  1  c 

o  11 


-  (c  -c  )  (3.39) 
1  o 


c  -c  (3.40) 

1  o 


Combining  (3.39)  and  (3.40),  we  obtain 


i-cl  C;L 

(1-c . )  log  - -  +  c.  log  —  >  0 

1  l-C  1  c 

o  o 


(3.41) 


and,  therefore,  ASN^  decreases  monotonically  by  increasing  the  number  of 
quantization  levels.  Similarly,  when  HQ  is  the  true  hypothesis,  the 
inequality  (3.36)  can  be  simplified  yielding 


L.H.S.  =  p  .  log 

rO  p 


rl 


+  P 


rO 


rOl 


1-c, 


(1-c  )  log  - - 

o  1-c 


+  c  log 
o 

o  o 


fll 

c  . 


(3.42) 


The  first  term  on  the  R.H.S.  of  (3.42)  is  equal  to  the  R.H.S.  of  (3.36) 

1~c1  C1 

while  the  terms  (1-c  )  log  — —  and  c  log  —  can  be  simplified  as 

o  o 

follows 

1_ci  r  1-ci  1 

(1_Co)  l0g  r-T  <  L  73T—  -  XJ  =  -  (crco)  (3.43a) 


ci  rci  i 

c  log  —  <  c  —  -1  =c-c 
o  c  o  Lc  J  1  c 


(3.43b) 


From  (3.43),  it  follows  that 
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p  [  ( 1— c  )  log  -  +  c  log  — 1  <  0  (3.44) 

rO  L  o  ’  1-c  o  c  J 
o  o 

and,  therefore,  ASN^  decreases  monotonically  by  increasing  the  number  of 
quantization  levels  and  the  proof  is  complete. 

3.5.  The  Effect  of  Transmission  Errors 

In  Section  3.3,  we  have  considered  the  quantization  of  each  of  the 
local  observation  into  two  levels,  namely  1  and  0.  There  it  was 
assumed  that  the  quantized  local  observations  are  communicated  without 
errors  to  the  global  decision  maker  where  a  centralized  SPRT  is  per¬ 
formed.  However,  the  assumption  of  error  free  transmission  is  not 
realistic,  especially  at  high  transmission  rates.  Therefore,  we  devote 
this  section  to  a  study  of  the  effect  of  transmission  errors  on  the  error 
probabilities  a  and  (3.  For  the  sake  of  clarity,  we  consider  the  case  of 
two  identical  local  detectors  and  the  following  observation  model 


H0  =  yik  s  B<po>  1  =  x'2;  k  -  1 


(3.45) 


H1  :  yik  "  B(P1) 


Pl  =  l-p0  >  0.5 


where  B(«)  is  as  defined  in  (3.15).  The  choice  p^  =  l-pQ  is  intended  for 
facilitating  the  formulation  of  the  SPRT  at  the  central  detector  as  a 
random  walk  of  equal  steps  [28,29].  The  communication  channel  is  assumed 
to  be  a  binary  symmetric  channel  in  which  the  error  rate  is  pg  a3  shown 
in  Fig.,  3.4. 

The  quantized  observations  encounter  errors  as  they  pass  through  the 
noisy  channel.  At  the  central  detector,  the  observation  model  in  (3.45) 
is  still  valid  but  with  different  values  of  parameters,  namely  p^  and  p^ 
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are  replaced  by  and  p^c  respectively.  When  H. ,  j=0, 1,  is  the  true 
hypothesis,  we  can  solve  for  p.  as  follows 


p.  =  p.(l-p  )  +  (1-p.)  p  =  p .  (l-2p  )  +  p 

jc  *3  e  3  e  *3  e  e 


(3.46) 


Suppose  that  H  is  the  true  hypothesis,  we  obtain  from  (3.46)  that  p^c  = 

A* 

p, (1-p  )  +  do  .  Similarly,  when  H.  is  true,  we  obtain  pn  =p  (1-p  )  + 

1  e  1  e  0  uc  u  e 

We  note  that 


1  -  P0c  =  1-PC 


P0Pe  ' 


PlPe 


=  p  (1-p  )  +  p.p^  = 
l  e  0  e 


lc 


(3.47) 


From  (3.47),  we  observe  that  the  observation  model  at  the  central 
detector  is  symmetric  and  is  given  by 


H0  :  ®ik  S  8(P0c} 


i  =  1,2,  k  >  1 


(3.48) 


H  :  0)  2  B  (p  ) 

1  xk  lc 


plc  ■  ^Oc  >  °'5 


where  to.,  is  the  kth  observation  of  the  ith  local  detector  as  seen  at 
ik 

the  front  end  of  the  global  processor. 

The  central  processor  observes  the  incoming  quantized  observations 
and  performs  the  SPRT.  Let  p_.  be  the  probability  of  a  local  observation 
arriving  as  1  at  the  central  detector  when  H ^  is  the  true  hypothesis. 
Therefore,  the  observation  model  of  the  central  detector  is  as  follows: 

111  ,  with  prob.  p^ 

10  or  01,  with  prob.  2p,(l-p.)  ,  k  >  1  (3.49) 

3  3 

00,  with  prob.  (l-p_^)2 
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t  a 


where  y  ,  is  the  kth  observation  of  the  central  detector.  From  (3.49), 
ck 

it  follows  that  the  logarithm  of  the  likelihood  ratio  function  is  a  dis¬ 
crete  random  variable  Z  whose  probability  distribution  function  under 

C  K 

the  hypothesis  H  .  is  given  by 


-  -  A 

2  log  (p1/pQ),  with  prob.  p  =  P 

H.  :  Z  ,  =  <0  ,  with  prob.  2p.(l-p.)  =  S.  j=0,l;  k  >  1  (3.50) 

1  ck  ,  3  3.  £ 

-2  log  (px/p0)  ,  with  (l-p_,)2  =  Q_. 


In  (3.50),  the  positive  step  (2  log  (p^/Pq))  is  equal  to  the  magnitude  of 
the  negative  step  (-2  log  (Pq/P-^  ) •  Therefore,  the  central  SPRT  can  be 
easily  formulated  as  a  random  walk  on  a  finite  number  of  states  (Markov 
chain)  as  shown  in  Fig.  3.5.  The  states  labeled  0  and  N  are  absorbing 
states  with  decisions  in  favor  of  and  respectively.  A3  long  as  the 
process  is  in  one  of  the  transient  states  labeled  1,2,...,N-1,  the 
central  test  must  be  continued  until  for  the  first  time  the  process 
reaches  one  of  the  absorbing  states.  For  the  Markov  chain  in  Fig.  3.5, 
it  is  well  known  [29]  that  if  the  process  starts  in  the  transient  state 
number  Z,  then  the  probability  of  absorption  in  the  state  0  is  given  by 


u . 
9 


(Q./P.)Z 
3 _ 3 

1  -  (Q./P.)N 
3  3 


and  the  average  sample  number  ASN .  is  given  by 


ASN  . 
3 


(N-Z) 


M(Q./P.)Z 
i  -  (Qj/Pj) 


(3.51) 


(3.52) 
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E'rom  (3.52),  it  follows  that  for  a  given  a  and  p,  we  must  have  2  and  N 
chosen  such  that  u^  >  1-a  and  u^  <  p.  To  this  end  let  us  distinguish 
between  the  following  two  cases: 

Case  1 :  In  this  case,  the  channel  is  assumed  to  be  error  free  while  in 
fact  it  is  not  so,  i.e.,  the  design  is  based  on  the  parameters  Pg=Pg  anci 
p^=p^  in  (3.45)  while  the  actual  values  of  the  parameters  are  pQ<l-2pe)  + 
p^  and  p^d-^p^)  +  p^  respectively.  Therefore,  the  result  will  be  higher 
error  probabilities  than  specified.  This  fact  can  be  shown  by  observing 


Pn^~2p)  +  P 


—  =  =  1°  .  [l  +  —  1/[i  +  — ^ - 1 

Plc  Pi(1_2pe)  +  pe  pi  L  d-2Pe)P0J  L  d"2pe)P1J 


(3.53) 


Since  p  <  p  ,  it  follows  that 


therefore,  we  have 


U-2pe.p0  <l-2pe)p1 


>  1  and, 


P0c  ^  P0 
Plc  PL 


(3.54) 


On  the  other  hand,  for  sufficiently  large  N  and  Z,  which  is  the  case 
when  the  a  and  p  are  small,  we  can  approximate  the  denominator  in  (3.51) 
by  1.  Consequently,  we  can  write 


.  Q  P  P 

t“»z  -  eV.  (^>2Z 

1  c  P1  P1  P1C 


(3.55) 


Similarly,  it  can  be  easily  shown  that 

a  =  1-u.  3 
c  0 


(__0C)  2  (N-Z) 
Plc 


(3.56) 


where  a ,  and  p^  are  the  actual  error  probabilities  at  t  e  central  level 
when  the  channel  i3  not  error  free  as  assumed.  Equations  (3.55)  and 


(3.56)  show  clearly  that  >  d  and  P^  >  p  and  hence  the  errors  over 
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the  channel  will  lead  to  an  increase  in  the  error  probabilities.  Simi¬ 
larly,  it  can  be  shown  that  ASN_.  as  given  by  (3.52)  is  dominated  by 

p.-  Q,  =  (p  -p  ) (l-2p  )  <  (p  -p  ) ,  which  will  lead  to  an  increase  in 
j  j  10  e  10 

ASN . . 

3 


Case  2:  In  this  case,  the  channel  is  assumed  to  be  completely  known, 

therefore,  the  central  detector  has  a  completely  specified  observation 

model  under  both  hypotheses  as  given  by  (3.49)  or  (3.50)  with  p.  =  p.  . 

3  jc 

The  exact  knowledge  of  the  observation  model  at  the  front  end  of  the 

central  processor  enables  us  to  redesign  the  central  detector  taking  the 

transmission  errors  into  account.  Consider  Fig.  3.5  once  again  and,  let 

Z  and  N  be  the  initial  and  final  states  instead  of  Z  and  N  respec- 
c  c 

tively.  Therefore,  we  can  write 


and 


a  > 


<p0c/plc> 


2 (N  -Z  ) 
c  c 


1  -  (p0c/plc  > 


A 

2N  “a 
c 


2Z 


P  * 


(p0c/pic) 


-  =  p 

2N  'a 


(3.57) 


(3.58) 


1  "  (P0C/Plc) 

where  the  inequalities  (3.57)  and  (3.58)  are  obtained  by  restricting  the 
actual  error  probabilities  a  and  (5  as  shown  on  the  right  hand  3ide  of 
(3.57)  and  (3.58)  to  be  less  than  or  equal  to  their  specified  values  a 
and  P  respectively.  The  fact  that  p^  /p  >  Pq/P^  implies  that  Z^, 
N^-Z^,  and  are  larger  than  their  corresponding  values  in  Case  1.  The 
average  sample  number  obviously  increases  with  increasing  Z  and  N  and, 
therefore,  ASN.,  j=0,l,  tends  to  increase  when  the  channel  is  noisy. 
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3.6.  Numerical  Results 


The  numerical  results  presented  in  this  section  are  obtained  assuming 
two  identical  local  detectors  as  described  in  Section  3.5.  The  observa¬ 
tion  model  is  as  described  by  (3.42)  with  £5^=0. 54  and  p^  =  1-f^  =  0.46  and 
the  communication  channel  is  as  depicted  in  Fig.  3.4.  The  process  of  per¬ 
forming  the  central  SPRT  is  formulated  as  a  random  walk  as  described  in 
Section  3.5.  The  error  probabilities  are  adjusted  by  suitable  variation 
of  the  initial  state  Z  or  the  number  of  states  N+l  or  both.  In  Table  3.1, 
we  present  some  values  of  Z  and  N  along  with  the  resulting  error 
probabilities  and  average  sample  numbers,  for  the  case  of  one  and  two 
local  sensc-3. 

Table  3.1.  The  error  probabilities  and  ASM' 3  for  an  error  free 
channel  when  one  and  two  local  sensors  are  used. 


ONE 

SENSOR 

TWO 

SENSORS 

a 

P 

Z 

N 

ASNX 

asn0 

Z 

N 

ASN1 

asn0 

8.15X10-3 

8 . 15xl0“3 

30 

60 

368.9 

368.9 

15 

30 

184.45 

184.45 

4.29X10-3 

4.29X10*3 

34 

68 

421.3 

421.3 

17 

34 

210.68 

210.68 

1.64X10-3 

1.64X10-3 

40 

80 

498.4 

498.4 

20 

40 

249.18 

249.18 

2.68x10“ 6 

2.68X10"6 

80 

160 

1000 

1000 

40 

80 

500 

500 

1.15X10"8 

1.03X10-6 

86 

200 

1425 

1075 

43 

100 

712.5 

537.5 

8.86X10”10 

1.09X10-7 

100 

230 

1625 

1250 

50 

115 

812.5 

625 

5.4X10-7 

4.4X10"9 

120 

210 

1125 

1500 

60 

105 

562.5 

750 

7.45X10"7 

8.86X10-10 

130 

218 

1100 

1625 

65 

109 

550 

812.5 
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The  numerical  results  presented  in  Table  3.1  are  obtained  under  the 
assumption  that  p^  =  0 .  However,  the  effect  of  channel  errors  on  the  per¬ 
formance  of  the  central  detector  is  demonstrated  in  Table  3.2  for  the  two 
cases  described  in  Section  3.5.  When  p^  =  0,  the  choice  Z  =  N/2  =  43 
yields  a  =  3  =  1.03x10  ^  and  ASN^  =  ASN^  =  537.5.  As  pg  varies  while  Z 
and  N  are  fixed,  the  error  probabilities  a  and  p  as  well  as  the  average 
sample  numbers  ASN^  and  ASN^  varies  as  shown  under  Case  1.  On  the  other 
hand,  when  p^  varies  while  Z  and  N  are  adjusted  accordingly  as  described 
in  Case  2  of  Section  3.5,  the  error  probabilities  are  maintained  while 
ASN^  and  ASN^  are  higher  than  their  values  in  Case  1. 


Table  3.2.  The  effect  of  transmission  errors  on  the 
performance  of  the  central  SPRT. 


CASE  1 

CASE  2 

Pe 

a  =3 
c  c 

ASN1=ASNQ 

Z=N/2 

a  =p 
a  ra 

asn1=asnq 

0 

1.03X10”6 

537.5 

43 

1.03xl0~6 

537.5 

0,02 

1.79X10”6 

559.9 

45 

0.065X10-6 

585.9 

0.04 

3. llxio"6 

584.2 

47 

0.955X10"6 

638.6 

0.06 

5.4X10'6 

610.8 

49 

0.994X10"6 

696.0 

0.08 

9.39X10-6 

639.9 

52 

0.833X10'6 

773.8 

0.1 

1.63X10-5 

671.9 

54 

0.973X10-6 

843.7 

0,12 

2.84X10-5 

707.2 

57 

0.938X10'6 

937.5 

0.15 

6.49X10-3 

767.8 

62 

0.917X10“° 

1107.1 
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3 . 7.  Discussion 


In  this  chapter,  we  studied  a  centralized  SPRT  based  on  multisensor 
data,  and  showed  that  the  average  sample  numbers  are  monotonically 
decreasing  functions  of  the  number  of  local  sensors.  We  also  studied  the 
issue  of  quantization  of  the  local  observations  and  showed  that  an  optimal 
quantizer  is  obtained  by  using  a  set  of  independent  local  likelihood  ratio 
quantizers,  when  the  local  observations  are  independent  both  spatially  and 
temporally.  Furthermore,  we  showed  that  by  increasing  the  number  of 
quantization  levels,  the  average  sample  numbers  decrease  monotonically. 

The  formulation  of  the  central  SPRT  as  a  random  walk  on  a  finite  state 
machine  of  fixed  structure  (Markov  chain)  is  considered  for  the  case  of 
two  identical  local  sensors.  The  numerical  results  in  Table  3.1  show 
clearly  that  when  both  Z  and  N  are  even,  the  average  sample  numbers  are 
exactly  half  of  their  counterparts  in  the  case  of  single  local  sensor. 

When  the  transmission  errors  are  ignored,  the  result  is  shown  to  be  an 
increase  of  the  error  probabilities  and  average  sample  numbers  over  their 
specified  values  as  demonstrated  in  Case  1  of  Table  3.2.  In  addition,  we 
showed  that  when  the  transmission  errors  are  known,  they  can  be  incorpo¬ 
rated  in  the  design  of  the  central  SPRT  to  maintain  the  specified  error 
probabilities.  In  this  case,  the  only  effect  of  transmission  errors  is  to 
increase  the  average  sample  numbers  as  shown  in  Case  2  of  Table  3.2. 
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CHAPTER  FOUR 


A  SIMPLE  MULT I -SENSOR  SEQUENTIAL 
DETECTION  PROCEDURE 

4.1.  Introduction 

In  this  chapter,  a  simple  multi-sensor  decentralized  sequential  test 
is  proposed  and  analyzed.  In  Section  4.2,  we  describe  the  proposed  multi¬ 
sensor  sequential  test  and  show  that  it  satisfies  the  prespecified  error 
probabilities  a  and  p.  The  ASN  of  the  proposed  test  is  shown  to  be  a 
monotonically  decreasing  function  of  the  number  of  sensors  used.  More¬ 
over,  we  show  that  the  increase  in  the  error  probabilities  as  a  result  of 
truncation  of  the  multi-sensor  sequential  te3t  at  any  stage  nT  is  le33 
than  its  counterpart  in  the  single  sensor  case,  i.e.  a  single  sensor 
sequential  test  also  truncated  at  stage  n^.  In  Section  4.3,  we  employ  the 
memoryless  grouped  data  sequential  detection  (MLGDS)  test  procedure  [21] 
at  the  local  sensors  in  the  multi-sensor  decentralized  scheme.  The  ASN  is 
shown  to  be  a  monotonically  decreasing  function  of  the  number  of  sensors 
used.  This  result  3hows  the  feasibility  of  using  our  testing  scheme 
based  on  multiple  sensors  that  employ  the  MLGDS  test  procedure  at  the 
local  sensors  to  outperform  Wald's  SPRT  based  on  a  single  detector.  In 
other  words,  the  decision  time  of  the  multi-sensor  system  using  MLGDS 
local  detectors  decreases  when  more  sensors  are  added.  At  some  point,  the 
performance  loss  due  to  the  grouping  of  samples  and  memoryless  nature  of 
the  MLGDS  test  is  overcome  and  from  then  on  the  performance  in  terms  of 
average  decision  time  is  better  than  Wald's  SPRT.  Our  scheme  compensates 
for  the  performance  loss  due  to  the  grouping  of  samples  and  the  memoryless 
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nature  of  the  MLGDS  procedure  by  adding  more  detectors.  In  Section  4.4, 
we  propose  and  analyze  a  truncation  scheme  for  the  case  when  the  sensors 
employ  the  MLGDS  test  procedure.  Numerical  results  are  presented  in 
Section  4.5  for  illustration.  Finally,  the  results  of  this  chapter  are 
discussed  in  Section  4.6. 

4.2.  The  Proposed  Multi-Sensor  Sequential  Test 

Consider  the  problem  of  testing  a  simple  hypothesis  versus  a  simple 

alternative  H  .  The  multi-sensor  observation  system  is  as  shown  in  Fig. 

4.1  which  consists  of  M  local  detectors  and  a  supervisor  (global) 

detector.  Let  f(x.,  0)  denote  the  pdf  of  the  random  variable  x^,  i  = 

: ,2, . . .,M,  representing  the  observation  at  the  ith  local  detector. 

Let  Hq  be  the  hypothesis  that  0  =  0Q,  and  the  hypothesis  that  0  *  0^. 

Therefore,  the  pdf  of  x^  is  given  by  f(x^,  0_.)  when  H_.  is  true,  j=0,l. 

The  successive  observations  on  are  denoted  by  xil'x^2'  ' '  "  xin  *n  ~  ^ 

and  they  are  assumed  to  be  iid.  The  observations  are  assumed  to  be 

independent  from  one  local  detector  to  the  other,  i.e.,  x^m  is  independent 

of  x.,  (m  *  k)  ,  and  x„  (£  *  i) ,  The  local  sequential  detectors  SD.,  i  = 
i  vin  x 

1,2,...,  M,  are  designed  to  satisfy  the  prespecified  values  of  the  system 
error  probabilities  a  and  (i.  Each  local  detector  performs  an  SPRT  based 
on  its  own  sequence  of  observations  and  transmits  its  decision  to  the 
supervisor  detector  as  30on  as  a  decision  on  one  of  the  two  hypotheses  is 
reached.  Nothing  is  transmitted  if  the  decision  at  the  local  detector  is 
to  continue  taking  observations.  The  supervisor  detector  accepts  the 
first  incoming  local  decision  and  declares  it  to  be  the  global  decision. 
Also,  it  informs  all  the  local  detectors  to  terminate  their  tests  when  the 
global  decision  is  reached.  It  should  be  mentioned  that  when  two  or  more 
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Fig.  4.1:  Block  diagram  of  the  simple  multi-sensor  sequential 
detection  scheme.. 
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local  decisions  are  available  to  the  supervisor  detector  at  the  same  time, 
only  one  of  them  will  be  accepted  by  the  supervisor  detector  and  this 
choice  is  arbitrary. 

Next  we  show  that  the  overall  system  satisfies  the  required  error 


probabilities  a  and  (3.  Let  denote  the  probability  that  the  global 


decision  is  based  on  the  decision  of  the  ith  local  detector  when  H.  is 

D 

the  true  hypothesis.  Then,  we  have 


M 

y~  Y.  .  -  1  j  =  0,1  (4.  L) 

1=1  13 


Let  e. .  denote  the  probability  of  error  at  the  ith  local  detector  when  H. 
J-D  D 

is  the  true  hypothesis.  Expressing  the  global  error  probabilities  and 
S  in  terms  of  e. ,'s  and  y. .'s,  we  obtain 

g  ID  ID 


and 


M 

y  no  Lo 

f=i 


(4.2) 


(4.3) 


However,  the  design  of  the  local  detectors  to  satisfy  a  and  3  implies 

that  e^Q  =  a  and  e^  =  3  for  i  =  1,2,...,M.  Therefore,  it  follows  from 

(4.2),  (4.3)  and  (4.1)  that  =  a  and  3g  =  P  as  required. 

The  SPRT  at  SD^  is  based  on  its  local  observations  and  is  performed 

in  a  manner  identical  to  Wald  [1],  i.e.,  at  stage  n  (n  >  1) ,  the  likeli¬ 
hood  ratio  function  A.  is  computed: 


k=l 


V 

£lxik'  V 


(4.4) 


47 


This  likelihood  ratio  is  compared  with  two  thresholds  as  follows 


2:  t .  ,  stop  and  decide  H, 

1U  1 

<  stop  and  decide 

otherwise,  continue. 

(4.5) 

where  t.  and  t.„  are  the  test  thresholds  of  SD.,  which  can  be  approxi- 
iu  it  i 

mated  11]  by 


1  -  e. 


tiu  e 


iO 


il  _  1  -  P 
a 


and 


(4.6) 


'il 


i£  1  "  eio 


1  -  a 


Note  that  the  number  of  samples  necessary  for  the  termination  of  the 


sequential  test  at  SD^  is  a  discrete  random  variable  T^ .  We  denote  by 
f Ti  (n)  and  FT^(n)  the  Pdf  and  the  cumulative  distribution  function  (cdf) 
of  the  test  duration  T^,  where 


fT  <n)  =  Pr{Ti  =  n} 
i 


n  >  1 


(4.7) 


F  (n)  =  Pr{ T .  5  n} 

T  i 

i 

The  global  test  length  is  also  a  discrete  random  variable  denoted  by  T^ 
and  its  pdf  and  cdf  are  denoted  by  f  (n)  and  fcTg(n)  respectively. 

Now  we  show  that  the  global  ASN  decreases  monotonically  with  the 
increase  in  the  number  of  local  detectors.  We  observe  that  for  the  M 
detector  system  in  Fig.  4.1  the  event  that  the  global  test  terminates 


after  the  nth  stage  is  the  same  as  the  event  that  none  of  the  (SDJ's  has 
terminated  at  stage  n  or  before.  Therefore,  we  can  write 


48 


n  >  1 


(4.8) 


Pr{Tg  >  n}  =  Pr{Tx  >  n,  T£  >  n,...,  Tm  >  n}  , 

Using  the  independence  of  local  detectors,  we  may  write 


Pr{  T  >  n}  =  [1  -  F  (n)  ]  -  [1  -  F  (n)  ]  x  [1  -  F  (n)  ]  x...x  [1  -  F  (n)  ] 
g  g  1  2  M 

(4.9) 

Since  (1  -  (n) ]  <1  (n  finite),  i  =  1,2,...,M,  it  follows  from 

i 

(4.9)  that  Pr{ T  >  n}  decreases  as  the  number  of  local  detectors 

g 

increases.  The  global  ASN  is  given  by 


ASN  =  Pr{T  >  n}  (4.10) 

L _  g 

n=U  * 

Since  for  each  value  of  n,  Pr{T^  >  n}  decreases  with  increasing  M,  it 
follows  from  (4.10)  and  (4.9)  that  ASN  decreases  monotonically  with  M. 
Also, 


ASN,  =  )“  Pr{ T .  >  n}  i  »  1,2,  ...,M  (4.11) 

1  n^O  1 

Since  Pr{T  >  n}  <  Pr{T.  >  n}  for  M  >  1,  the  ASN  is  3trictly  less  than 

g  * 

any  ASN.,  i  =  1,2,...,M.  It  should  be  emphasized  that  the  above 
analysis  is  valid  under  both  hypotheses. 

From  equation  (4.9),  it  can  be  observed  that  the  probability  of 
termination  of  che  test  after  the  nth  stage  is  small  since  it  i3  the  product 
of  the  corresponding  probabilities  of  the  local  tests.  However,  it  is 
well  known  [1]  that  for  the  SPRT  there  does  not  exist  any  definite  upper 
bound  for  the  number  n  of  samples  to  be  observed  before  reaching  a 
decision.  Large  values  of  n  are  possible  with  a  nonzero  probability. 
Therefore,  it  is  necessary  from  an  implementation  point  of  view  to  set  a 
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definite  upper  bound  nT  on  n.  This  can  be  achieved  by  truncating  the 
sequential  procedure  at  n=nT,  i.e.,  by  specifying  a  new  decision  rule  for 
the  acceptance  or  rejection  of  at  the  n^th  stage  if  the  sequential  test 
did  not  terminate  prior  to  stage  nT- 

Next  we  show  that  the  increase  in  error  probabilities  due  to 
truncation  of  the  global  test  at  stage  n^  is  a  decreasing  function  of 
M.  Let  aT<nT>  and  PT(nT>  be  the  global  error  probabilities  at  the 
truncation  stage  n^.  Recall  that  all  local  detectors  are  identical, 
therefore,  if  we  consider  only  one  of  the  local  sequential  detectors,  say 
SD^,  i  =  1,2,...,M,  we  can  write 

0.  (n  )  =  Pr{A.  >  v /H  and  t.„  <  A.  <  t.  for  all  1  <  n  <  n  }  (4.12) 

T  T  m  0  it  m  iu  T 

T 

where  v  is  the  threshold  at  the  truncation  stage.  Similarly,  we  can 
write 

0_(n_)  =  Pr{ A.  <  v/H,  and  t,„  <  A.  <  t.  for  all  1  <.  n  <>  n_}  (4.13) 

'r  T  1  in  1  i2  in  iu  TJ 

T 

Let  a3<nT)  and  be  the  error  probabilities  satisfied  by  the  sequen¬ 

tial  portion  of  the  test.  The  optimality  of  Wald's  SPRT  [1]  implies  that 
no  other  sequential  procedure  can  satisfy  the  same  error  probabilities 
with  a  smaller  ASN.  However  for  the  truncated  te3t  the  ASN  is  obviously 
smaller.  Therefore,  at  least  one  of  the  error  probabilities  must  be 
higher  than  required.  In  our  analysis,  we  assume  aT(nT)  >  a3(nT)  anci 
^T^nT*  >  ^s*nT^  f°llowin9  the  convention  in  the  literature  [1,31].  For 

the  truncated  test,  the  overall  error  probabilities  a  (M)  and  0  (M)  for 

o  o 

the  M  detector  case  are  given  by: 

ao(M)  =  as(nT)  Pr{Tg  S  nT / Hq }  +  aT(nT)  ?r{Tg  >  nT/HQ}  (4.14) 
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(4.15) 


P0(M)  =  PS(nT’  Pr^Tg  2  W  +  PT'“T)  Pr(Tg  >  W 

and  for  the  single  detector  case,  say  for  the  ith  detector,  we  have 

tto(1)  =  as(nT)  Pr{Ti  -  W  +  W  Pr{Ti  >  W  <4,16) 

Pod)  =  Ps(nT>  Pr{Ti  *  VV  +  PT<nT>  Pr^Ti  >  nT/Hl}  (4'17) 

Simplifying  (4.14)  and  (4.16),  we  obtain 

VM)  =  ao(nT)  '  “VV  "  W1  Pr{Tg  -  nT/Hl}  (4-18> 

and 

“o(1)  =  VV  ~  CVV  ‘  as<nT)]  Pr{Tg  -  nT/Hl}  (4'19) 

respectively.  Taking  the  difference  a  (1)  -  a  (M) ,  we  arrive  at 

o  o 

a0(1)  •  VM)  =  (VV  ’  VV1  *  [Pr{Tg  5  nT/H0}  '  Pr(Ti  -  nT/H0}] 

As  shown  before,  the  second  term  on  the  right  hand  side  of  (4.20)  in¬ 
creases  with  increasing  M.  Therefore,  (Xq (M)  is  a  decreasing  function  of 
M.  In  the  same  manner,  we  can  show  that  (M)  is  also  a  monotonically 
decreasing  function  of  M. 

In  the  next  section,  we  focus  our  attention  on  the  analysis  of  the 
multi-sensor  decentralized  scheme  when  the  local  detectors  use  the  MLGDS 
procedure  of  Lee  and  Thomas  (21]  .  In  this  case,  simple  expressions  can 
be  obtained  for  the  test  length  probability  distribution  at  both  the 
local  and  global  levels.  Also  the  expression  for  a (n^) ,  jig(nT), 
aT(nT) •  and  PT(nT)  are  simpler  to  find  unlike  the  case  considered  above 
where  each  local  detector  performs  Wald's  SPRT. 

4.3.  The  Multi-Sensor  Decentralized  Scheme  using  the  MLGDS  Procedure 
Consider  the  problem  of  testing  a  simple  hypothesis  versus  a 
simple  location  alternative  H  .  The  observation  model  is  as  defined  in 
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Section  4.2  except  for  the  fact  that  f{x.,  9.)  is  replaced  by  f(x.-0.). 

ill! 

In  other  words,  the  observation  model  is  given  by 
Hq  :  x.  ~  f (x  -  90) 


,  i  £  1 


(4.21) 


H.  :  x.  -  f (x  -  9. )  . 

1  i  1 

where  0,  >  9„  and  x.  ~  f(x-0.)  means  that  the  ith  observation  x.  of  any 

1  0  i  ]  i 

local  detector  has  a  pdf  f (x-0  )  when  H^,j  =  0,1,  is  the  true  hypothesis. 

The  pdf  f(-)  is  assumed  to  be  continuous  and  symmetric,  i.e.,  f(x)=f(-x). 

The  multi-sensor  decentralized  scheme  using  identical  MLGDS  detectors  can 

be  described  as  follows.  At  the  nth  stage  (n  >  1),  and  at  each  local 

sensor,  using  the  current  group  of  Nq  samples,  a  sufficient  test  statistic 

T  (x„  )  is  formed,  where 
N  — N 
O  O 


XM  =  X  ,  X  ,  .  .  .  ,  X 

Nn  L  (n-l)NQ+l  (n-2) N  +1  nNnJ 


(4.22) 


Since  the  successive  observations  are  assumed  to  be  iid,  it  follows  that 
the  successive  test  statistics  are  iid.  Once  the  test  statistic  is 
formed,  the  following  test  is  performed  at  each  local  detector 


>  A  ,  stop  and  decide 
<  B  ,  stop  and  decide 

TN  (~N  *  [otherwise,  discard  all  previous  stages 
o  o 

'and  proceed  to  the  next  stage  • 


(4.23) 


where  A  and  B  are  the  test  thresholds  with  A  >  B.  The  values  of  A, B,  and 
Nq  are  predetermined  such  that  the  prespecified  values  of  a  and  [3  are 
satisfied  and  the  global  ASN  of  the  test  is  minimized. 


Let  P0  =  Pr{TN  (xN  )  £  A/Hq} ,  q0  =  Pr{TN  (x^  £  B/HQ},  and  rQ 
o  o  o  o 
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1 


-  (p0+qQ) .  Similarly  let  px  =  Pr{ (xN  )  >  A/H^^  =  Pr{  (xN>  <  B/H^, 

0  o  o  o 

and  r  =  1  -  (p^+q^)  •  Since  the  local  detectors  as  well  a3  their  observa¬ 
tions  are  identical  at  all  stages,  it  follows  that  the  global  decision 
will  be  based  on  any  one  of  the  local  decisions  with  equal  probability, 
i.e. , 


Tlj  **21  '  **«,  '  1  '  °'1 


(4.24) 


Since  the  local  sequential  test  can  terminate  at  any  stage,  the  error 
probabilities  can  be  expressed  as 


a  =  po  +  ropo  +  ropo  +  • • 


^0 

1  -  r. 


(4.25) 


P  =  qi  +  riqi  +  rlqi 


nl 

1  -  r. 


(4.26) 


Equations  (4.25)  and  (4.26)  give  the  necessary  conditions  imposed  on  A 
and  B  for  a  particular  choice  of  the  package  size  to  satisfy  a  and  3. 

We  observe  that  the  event  of  test  termination  is  independent  from  stage 
to  stage,  and  from  one  detector  to  another.  Therefore,  the  global  test 
length  is  geometrically  distributed  random  variable.  When  H  is  true,  its 
pdf  is  given 


f_  /u  (t  /H . )  =  (1  -  rM)(rM)n_1  •  S(t  -nN  )  ,  n  >  1  (4.27) 

T  /H  g  3  33  go 


M 

where  S(-)  is  the  Kronecker  delta  function  and  (r,)  is  the  probability 
that  all  of  the  local  te3ts  fail  to  decide  on  a  hypothesis  and  must  con¬ 
tinue  at  any  single  stage.  From  (4.27),  we  can  express  the  ASN  under  the 


hypothesis  H  to  be 
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(4.28) 


ASN.  =  E [T  /H,] 
3  g  J 


N 

o 


As  was  pointed  out  in  [21],  we  must  select  the  package  size  Nq  and  the 
test  thresholds  A,  and  B  before  the  test  can  be  executed.  The  thresholds 
A  and  B  must  obviously  satisfy  (4.25)  and  (4.26),  and,  in  general,  mini¬ 
mize  a  weighted  sum  of  ASN^  and  ASN^  as  given  by 


ASN  =  -n  ASNq  +  (1-n)  ASNX  ,  T|  £  [0,1]  (4.29) 


An  analytical  solution  of  this  optimization  problem  does  not  appear  to  be 
feasible  and,  therefore,  a  numerical  solution  is  necessary.  In  the 
following  we  propose  an  algorithm  for  the  determination  of  the  optimal 
package  size  Nq  and  the  test  thresholds. 

Step  1:  Pick  a  package  of  size  N  =  n  etl,  N _ ]  (where  N _  is  the 

- -  O  O  FSS  FSS 

number  of  observations  such  that  r,  =  0),  and  choose  A(n  )  with 

J  o 


and 


PA(n  ) 
0  o 


A 

R(n  ) 
o 


£I  /H0>  d  T„  <  “ 

no  o 

o 


(4.30) 


P,  (n  ) 
1  o 


A 

A  (n  ) 


f  (x  /H)  d  T  <  1-p 
T  -n  1  n  r 

no  o 

o 


(4.31) 


Step  2:  For  (n^)  in  (4.30)  there  exists  a  single  threshold  B^n^  such 
that  a  is  satisfied.  Let 
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w 


Step  3 
a) 


b) 

c) 

Step  4 


W 

£T  <*n  'V  d  Tn  ' 
no  o 

-oo  O 


it  follows  that  a  is  given  by 


a  = 


vv 


P0(no)  +  q0(no) 


(4.32) 


Similarly,  let  (n^)  = 


B.  (n  ) 
r  l  o 

f_  (x  /H  )  d  T 
T  — n  1  n 

no  o 

-oo  o 


and  obtain  (n  )  such  that  P  as  given  by 


3  = 


ql(no> 


q,  (n  )  +  P  (n  ) 
1  o  1  o 


(4.33) 


xs  satisfied. 


We  have  one  of  the  following  three  cases: 


B  (n  )  <  B  (n  ) .  This  case  represents  the  situation  where  A(n  ) 
0  o  1  o  o 

is  higher  than  the  optimal  A(n  ),  because  any  value  of  B(n  ), 

o  o 

B  (n  )  <  B  (n  )  <  B.  (n  ) ,  will  yield  smaller  values  of  a  and  B 
U  O  1  O  1  o 

than  specified.  Therefore,  choose  a  smaller  value  of  A(n  )  and 

o 


repeat . 


B  (n  )  >  B,  (n  )  .  In  this  case,  there  is  no  value  of  B(n  )  that 
u  o  1  o  o 

will  satisfy  the  required  a  and  p.  Therefore,  choose  a  higher 


value  of  &(nQ)  and  repeat. 


Bq  (n^)  =  B  (n  )  .  This  case  represents  the  solution  which  we  are 

seeking.  Therefore,  let  A(n  )=A(n  )  and  B,  (n  )  =  B(n  )-£L(n  ). 

o  o  1  o  o  0  o 

Once  the  thresholds  for  the  selected  n  are  determined,  we 

o 

repeat  the  process  to  determine  the  thresholds  for  (n  -1) .  Com- 

o 
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puce  the  corresponding  ASN  given  by  (4.29).  If  the  ASN  for  n 

o 

is  less  than  the  ASN  for  (n  -1) ,  we  increase  n  .  otherwise,  we 

o  o 

choose  a  smaller  value  of  n  . 

o 

Step  5:  The  algorithm  must  be  continued  until  for  the  first  time  at  n*, 

the  process  of  increasing  or  decreasing  n^  is  reversed.  The 

optimal  package  size  is  obviously  N*  =  n*. 

The  above  algorithm  is  based  on  the  fact  that  r^  is  monotonically 

decreasing  function  of  the  package  size  Nq.  This  fact  has  been  proven 

for  equal  error  probabilities  and  normally  distributed  observations  in 

[21].  To  prove  the  above  fact  in  the  general  case,  we  proceed  as  follows. 

Let  and  (1  -  r  (Nq)  )  be  any  package  size  and  the  corresponding 

probability  of  decision  on  one  of  the  hypotheses  when  H_.  is  true.  Let 

(N  +AN  )  and  [l-r.(N  +AN  )]  be  another  package  3ize  and  the  corresponding 
o  o  300 

probability  of  a  decision  on  one  of  the  hypotheses.  If  we  view  the 
increment  AN^  as  an  independent  package  and  apply  the  MLGDS  procedure  to 
the  individual  packages  of  size  Nq  and  ANq,  we  may  write 


r  .  (N  +  AN  )  =  r  .  (N  )  •  r  .  (AN  ) 

30  o  30  3° 


(4.34) 


since  r .  (AN  )  is  strictly  less  than  one  for  AN  >  1,  it  follows  that 
jo  o 

r .  (N  +  AN  j  <  r,(N  )  and,  therefore,  the  proof  is  complete, 
j  o  o  j  o 

Returning  to  the  multi-sensor  decentralized  scheme,  we  observe  from 

(4.28)  that  ASN ^  is  a  monotonically  increasing  function  of  the  number  of 

local  detectors  M.  In  the  remainder  of  this  section,  we  consider  the 

truncation  of  the  sequential  test  at  the  n^th  3tage.  A  package  of 

observations  of  the  same  3ize  N  is  taken  at  one  of  the  local  detectors 

o 


and  the  following  single  threshold  test  is  performed. 
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(4.35) 


' 

>  v  ,  decide  H 

tn  <*n  > 

o  o  [  <  v  ,  decide  H 

0 

where  v  is  the  test  threshold  at  the  (nT+l)th  stage.  Let  aT<N0)  anc* 

P  <N  )  be  the  error  probabilities  at  the  truncation  stage  with 

VV  =  Pr{TN  (xN  )  >  v/H()} 

o  o 

(4.36) 

Pt(N0)  =  Pr{TN  N  >  <  V/Hl} 
o  o 

The  overall  global  error  probabilities  aQM)  and  pQ  (M)  are  as  given  in 
(4.14)  and  (4.15)  respectively.  Upon  writing  (4.14)  and  (4.15) 
explicitly  and  observing  that  as<nT)  =  anc*  P3  (n^)  =  P?  we  obtain 

M  nT 

a  (M)  =  a  +  (aT  -  a)  <r  )  (4.37) 

and 

M 

Pq(M)  -  P  +  <PT  -  P)  (r^  (4.38) 

Since  r'1  <  1,  j  =  0,1,  it  follows  from  (4.37)  and  (4.38)  that  the  increase 
in  error  probabilities  (aQ(M)  -  a}  and  (Pq(M)  -  P}  is  a  monotonically 
decreasing  function  of  M,  the  number  of  sensors,  as  well  as  the  truncation 
stage  n^.  Consequently,  the  M  detector  system  is  less  sensitive  to 
truncation. 


The  Proposed  Truncation  Scheme 


We  have  shown  that  the  probability  of  the  global  test  length  T^ 
exceeding  a  specified  truncation  stage  n^  is  less  for  the  multi-sensor 
scheme  than  for  the  single  detector  case.  However,  T^  is  still  a  random 
variable  which  can  assume  excessively  large  values.  To  avoid  this 
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problem  which  will  limit  the  practical  use  of  sequential  procedures,  trun 
cation  is  necessary.  The  truncation  problem  is  not  simple  to  analyze 
[31-33]  except  for  the  MLGDS  procedure  in  [21] . 

The  proposed  truncation  scheme  for  the  system  under  consideration  is 
as  follows:  choose  A*  and  B*  (the  test  thresholds)  such  that 


and 


<  a 


(4.39) 


(4.40) 


where  P*,  r*,  q*,  and  r*  are  as  defined  earlier  when  the  thresholds  A*, 
and  B*  are  used  instead  of  A,  and  B  respectively.  The  local  and  global 
sequential  tests  are  performed  as  usual.  If  no  decision  is  reached  up  to 
the  nTth  stage,  only  one  local  detector  is  allowed  to  take  one  more  pack¬ 
age  of  observations  of  the  same  size  and  test  against  a  single  threshold 
v  as  described  in  (4.35).  Writing  the  expressions  for  the  overall  error 
probabilities,  we  obtain 


and 


M  T 

a  (M)  -  a*  +  (a(N  )  -  a*)  (r*  ) 
o  To  0 

M  ^T 

3  (M)  =  3*  +  <8  <N  )  -  P*) (r*  ) 
o  To  1 


(4.41) 


(4.42) 


where  a^tN^)  anc*  3T(N0)  are  33  defined  by  (4.36)  .  Since  a*  <  a  and  3*  < 

M  nT 

and  the  term  (r^  )  is  a  decreasing  function  of  n^;  there  exists  a  value 

of  n,  at  which  a  (M)  <  a  and  3  (M)  S  3-  The  smallest  value  of  n„  at 
T  o  o  T 

which  both  a  (M)  and  3  (M)  become  smaller  than  a  and  3  will  be  used  for 

o  ro 

truncation.  It  should  be  emphasized  that  for  different  values  of  a*  and 
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(3*,  the  solution  for  n^  is  different.  However,  the  determination  of  a* 
and  |3*  for  some  value  of  the  package  3ize  is  not  simple  because  of  the 
continuous  nature  of  a*  and  p*.  In  other  words,  all  values  a*  £(0,a)  and 
P*  £  ( 0 ,  p )  are  possible.  On  the  other  hand,  since  the  ASN  of  the 
untruncated  test  is  less  than  that  of  the  truncated  test  for  the  same  a 
and  p,  we  can  say  that  a  reasonable  choice  of  a*  and  P*  is  that  which 
leads  to  a  small  increase  in  the  ASN's  over  their  untruncated  counterparts 
while  maintaining  the  constraints  on  a  and  p.  The  average  sample  number 
of  the  truncated  test  is  given  by 


N 


M  T+1 
1  -  (r*  ) 

3 _ 


1  -  r 


M 


(4.43) 


when  the  hypothesis  H_.  is  true,  j=0,l. 


4,5.  Numerical  Results 

In  this  section,  we  present  some  numerical  results  for  the  multi¬ 
sensor  decentralized  scheme  using  MLGDS  procedure.  The  observation  model 
is  given  by 

HQ :  x  ~  N (0 ,  a2  ) 

:  xi  ~  N(9,  a2 ) 

where  N(0,  a2)  is  the  normal  density  function  with  mean  0,  and  variance 
a2 .  We  assume  that  9=0.2  and  a2  =  1 .  It  is  well  known  that  for  a  =  P, 
the  number  of  observations  necessary  for  the  optimal  FSS  linear  detector 
(Neyman-Pearson)  is  the  solution  of  the  equation 


Vs "  ‘f  <1>"1  '  a,>J 


59 


where  <!>{•)  is  the  cdf  of  the  normal  density  function  of  zero  mean  and  unit 
variance.  The  ASN  of  Wald's  SPRT  is  approximated  by  the  following 
equation  [1] 


(1-p)  log 


ASN. 


a 


P  log 


l-a 


82 /2c2 


In  Table  4.1,  the  optimal  package  size  N*  is  determined  numerically 
for  the  case  of  M  local  detectors,  M  =  1,2,3.  This  optimal  value  depends 
on  the  specified  error  probabilities  a  and  p  as  well  as  the  number  of 
local  detectors.  The  values  of  N  ,  ASNvj'  ASN(M),  and  N^  are  given  for 
some  values  of  a  and  (3.  Note  that  for  smaller  values  of  a  and  p,  it 
takes  more  sensors  to  compensate  for  the  loss  due  to  grouping  of  data. 


Table  4.1:  Optimal  package  sizes  N*  and  their  corresponding  ASN (M) 


a=P 

nfss 

ASNw 

N* 

ASN (1) 

N* 

2 

ASN  (2) 

N* 

ASN (3) 

IQ'2 

541 

225.2 

208 

312.0 

163 

219.1 

143 

181.8 

10-3 

954 

344.6 

359 

492.1 

289 

359.9 

246 

305.9 

io”4 

1383 

460.4 

511 

664.2 

406 

499.4 

364 

431.1 

10"5 

1818 

575.6 

659 

830.7 

534 

638.9 

485 

557.2 

10-6 

2259 

690.8 

808 

993.5 

663 

776.3 

600 

683.4 

The  increase  in  error  probabilities  as  a  result  of  truncation  is 
demonstrated  by  the  numerical  results  in  Table  42.  The  package  size  is 
set  at  N*  (given  in  Table  4.1)  irrespective  of  the  number  of  local 
detectors.  The  central  test  is  truncated  at  the  third  stage.  The 
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resulting  error  probabilities  a and  Pq,  and  the  ASN  of  the  truncated 
T 

test  (ASN  )  are  given  for  some  values  of  a  and  (3. 


Table  4.2:  Overall  error  probabilities  and  (3q,  and  the  ASN  for 


the  truncated  test 


<x=p 

ONE  DETECTOR 

a  =P  asnT 

O  0 

TWO  DETECTORS 

a  =P  asnt 

o  0 

THREE  DETECTORS 

a  =p  asnt 

o  o 

io'2 

-2 

1.24X10 

308.1 

1.009X10-2 

234 

1.003X10-2 

216 

i(f3 

1.55X10"3 

489.5 

1.011X10-3 

387.3 

1.0002X10-3 

366.2 

l 

o 

r— 4 

2.45X10-4 

662.3 

1.018X10-4 

539.7 

1.0002X10-4 

527.3 

io'5 

5.52X10"4 

829.2 

1.04X10-5 

688.4 

1.0004X10-5 

664.9 

icf6 

1S.56X10-6 

992.3 

1.095X10"6 

837.2 

1.0006X10-6 

813.3 

The  truncation  scheme  proposed  in  Section  4.4  is  applied  to  the 

distributed  system  of  two  and  three  local  detectors.  The  package  sizes 

are  the  opt.mal  for  the  untruncated  test  as  given  in  Table  3.1.  However, 

the  sequential  portion  of  the  test  is  designed  to  satisf'  and  P3  ((X3  < 

a,  <  P)  and  the  truncation  stage  M  is  obtained  by  requiring  the  overall 

error  probabilities  and  Pq  to  be  less  than  or  equal  to  a  and  P  respec- 

T 

tively.  In  Table  4.3,  we  present  values  of  M,  ASN,  ASN  ,  a  ,  P  ,  a  ,  and 

o  o  s 

Pg  as  obtained  for  different  values  of  a  and  p. 
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Table  4.3:  Overall  error  probabilities  and  ASN's  of  the  proposed  trun¬ 


cation  scheme . 


a=p 

M 

ASN 

TWO 

T 

ASN 

SENSORS 

a  =p 

3  3 

oR 

H 

T» 

o 

M 

ASN 

THREE 

T 

ASN 

SENSORS 

a  =p 

3  3 

a  -p 
o  ro 

-2 

-3 

-3 

-2 

-3 

10 

5 

219 . 1 

220.2 

9,79X10 

9.9X10 

5 

181.8 

182.8 

9.81X10 

9.86X10 

-3 

-4 

-4 

-4 

-5 

10 

5 

359.9 

361.5 

9.74X10 

9  88x10 

5 

305.9 

307.2 

9.72X10 

9.72X10 

-4 

-5 

-5 

-5 

-3 

10 

6 

499.4 

501.5 

9.65x10 

9.76X10 

5 

431.1 

432.6 

9.65X10 

9.94X10 

-5 

-6 

-6 

-6 

-6 

10 

6 

638.9 

640 . 9 

9.58x10 

9.81X10 

6 

557.2 

559.2 

9.62X10 

9.69x10 

-6 

-7 

-7 

-7 

-7 

10 

7 

776.3 

779 

9.56x10 

9.64X10 

6 

683.4 

685.7 

9.58X10 

9.86x10 

4,6.  Discussion 

In  this  chapter,  we  proposed  a  simple  multi-sensor  decentralized 
sequential  detection  procedure  and  derived  the  expressions  for  the  ASN's. 
The  case  when  each  local  detector  employs  the  MLGDS  procedure  of  Lee  and 
Thomas  [21]  is  considered  in  detail  and  a  truncation  scheme  is  analyzed. 

We  showed  that  the  performance  of  the  multi-sensor  scheme  is  a  mono- 
tonically  increasing  function  of  the  number  of  local  sensors  used.  The 
increase  in  error  probabilities  due  to  truncation  of  the  sequential 
procedure  at  any  stage  is  shown  to  be  a  decreasing  function  of  the  number 
of  local  detectors.  The  numerical  results  presented  show  clearly  the 
improved  performance  and  support  the  analysis.  In  the  simple  multi-sensor 
scheme,  each  local  detector  was  designed  independently  to  satisfy  the 
specified  error  probabilities,  i.e.,  no  coupling  between  local  thresholds 
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was  allowed.  It  is  expected  that  if  coupling  between  local  thresholds  is 
allowed  and  local  decisions  are  all  taken  into  account  in  the  process  of 
arriving  at  the  global  decision,  the  resulting  system  performance  will  be 
further  improved.  In  the  next  two  chapters,  we  will  study  distributed 
sequential  detection  systems  in  which  the  local  thresholds  are  coupled  and 
all  local  decisions  are  fused  at  the  global  decision  maker  to  arrive  at 
the  global  decision. 
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CHAPTER  FIVE 


A  DECENTRALIZED  SEQUENTIAL  TEST 
WITH  DATA  FUSION 

5.1.  Introduction 

In  Chapter  Four,  we  studied  a  simple  multi-sensor  sequential  detec¬ 
tion  procedure  and  showed  that  its  performance  improves  monotonically 
with  the  increase  in  the  number  of  sensors  used.  The  sequential  test  in 
Chapter  Four  is  carried  out  locally  and  only  local  decisions  are  communi¬ 
cated  to  the  global  decision  maker.  The  local  thresholds  are  not  coupled 
because  no  explicit  data  fusion  rule  is  employed.  All  the  thresholds  are 
designed  to  satisfy  the  global  error  probabilities  a  and  p. 

In  this  chapter,  we  allow  the  local  thresholds  to  be  coupled.  Each 
local  detector  performs  a  SF  IT  based  on  its  own  observations,  and 
communicates  its  decision  to  the  global  decision  maker  (fusion  center) 
whenever  it  decides  on  a  hypothesis.  Nothing  is  communicated  if  the  local 
decision  is  to  continue.  The  global  decision  maker  combines  the  incoming 
local  decisions  according  to  a  predetermined  fusion  rule  to  come  up  with 
the  global  (final)  decision.  In  Section  5.2,  we  describe  the  sequential 
detection  system,  and  define  the  observation  model.  The  global  error 
probabilities  are  functions  of  the  local  error  probabilities  and  thje 
fusion  rule.  An  analysis  for  different  possible  fusion  rules  is  presented 
in  Section  5.3,  with  emphasis  on  the  relationship  between  the  local  and 
global  error  probabilities.  Moreover,  the  expected  global  test  length 
is  derived  in  terms  of  the  local  error  probabilities  and  the  pdf's  of  the 
local  test  length.  The  case  of  three  local  detectors  is  considered 
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briefly  in  Section  5.4.  An  example  is  presented  in  Section  5.5  to 
illustrate  the  design  of  the  local  tests  for  prespecified  values  of  a  and 
(3,  and  for  different  fusion  rules.  In  Section  5.6,  we  discuss  the  re¬ 
sults  obtained  in  this  chapter. 


5.2.  A  Description  of  the  Distributed  System 

Consider  the  problem  of  testing  a  simple  hypothesis  HQ  versus  a  simple 

alternative  H  .  The  system  consists  of  two  sequential  detectors  and  a 

global  decision  maker  as  shown  in  Fig.  5.1.  Let  f(x,8  )  and  f (y,  0  ) 

x  y 

denote  the  pdf's  of  the  random  variables  x  and  y,  which  represent  the 

observations  of  the  first  and  second  sequential  detectors  respectively. 

Let  K  be  the  hypothesis  that  0  =  8  _,  8  =  0  and  H,  the  hypothesis 

0  x  xO  y  yO  1 

that  0  =  0  ,  and  0  =  0  , .  Thus,  the  pdf's  of  x  and  y  are  given  by  f (x, 

x  xl  y  yl  i  'a  i 

0  .)  and  f(y,  0  ,)  when  H.  is  true,  i=0,l.  The  successive  observations  on 
xi  yi  x 

x  are  denoted  by  x  .x_, ...,  x  =  x  (n  >  1)  and  they  are  assumed  to  be 
i  z  n  — n 

samples  of  iid  random  variables.  Similarly,  the  successive  observations 
on  y  are  denoted  by  y^  =  ^  (n  >  1)  and  they  are  assumed  to  be 

iid  random  variables.  For  any  two  positive  integers  n,m,  the  joint  pdf  of 
x^  and  v^  conditioned  on  the  hypothesis  is  given  by 


■V“i 


IT 

3=1 


f  (x, 


0  .) 

xi 


m 

TT 

3=1 


f(y3 


0  .) 
yi 


0,1 


(5.1) 


Based  on  its  own  observations,  each  local  detector  performs  an  SPRT.  The 

SPRT  at  the  first  detector  is  defined  as  follows:  Two  positive  constants 

A^  and  B1 (A  >  B^)  are  chosen  and  at  each  stage  n,  n  >  1,  of  testing,  the 

likelihood  ratio  function  A  is  computed: 

— n 
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Fig.  5.1:  A  distributed  system  consisting  of  two  local 
detectors  and  a  global  decision  maker. 
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A  similar  SPRT  is  implemented  at  the  second  detector  with  A and 
as  the  thresholds.  The  choice  of  the  thresholds  A^  and  B^,k  =  1,2, 
depends  on  the  values  of  the  local  error  probabilities  and  p^.  It  ha3 
been  shown  [1]  that  the  thresholds  are  approximated  in  terms  of  and  P^ 
as  follows:' 

\  *  11  -  V'° 

,k  -  1,2  (5.4) 

Bk 3  V(1  -  V 

A  more  convenient  local  test  can  be  obtained  by  talcing  the  logarithm 

of  both  sides  of  (5.3).  The  resulting  te3t3  at  the  two  detectors  are 

\ 

given  by 

>  log  A,  ,  decide  H„ 
n  k  1 

Y~  j  <  log  ,  decide  ,  k  =  1,2  (5.5) 

.  3=1  otherwise  ,  continue, 

where 

V!j  “  l0^  tf<xj '  9xl) /f (xj'  9x0)] 

and 

v2j  £  log  [f(y.,  eyi>/f(y.,  0yQ)] 


As  mentioned  above,  the  local  error  probabilities  and  P^  are 
functions  of  the  local  thresholds  A^  and  B^,  k  =  1,2.  By  adjusting  the 
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local  thresholds  we  can  vary  the  corresponding  local  error  probabilities. 
Since  the  observations  are  independent  from  one  local  detector  to  the 
other,  the  local  decisions  are  also  independent.  As  in  the  distributed 
fixed-sample-size  systems  using  Neyman-Pearson  criterion,  we  can  not 
obtain  the  global  error  probabilities  without  the  knowledge  of  the  fusion 
rule.  In  other  words,  the  global  error  probabilities  are  functions  of  the 
local  error  probabilities  and  the  fusion  rule.  Conversely,  given  the 
global  error  probabilities  a  and  P,  we  can  choose  the  fusion  rule  first, 
and  then  solve  for  a  and  P  ,  k  =  1,2,  or  equivalently  A  and  B  such  that 

K  K  K  K 

a  and  P  are  satisfied  and  the  global  average  sample  number  is  minimum. 

The  global  average  sample  number  ASN^  is  defined  as  the  average  number  of 
observations  necessary  for  reaching  a  global  decision.  Clearly,  an 
optimal  solution  will  require  the  choice  of  the  fusion  rule  among  all 
possible  fusion  rules  and  solving  for  the  local  thresholds  such  that  ASN^ 
is  minimized.  However,  for  a  distributed  system  consisting  of  two  local 
detectors  and  binary  local  decisions  (assuming  that  local  detectors  have 
reached  a  hypothesis  decision) ,  we  observe  that  the  number  of  possible 
binary  fusion  rules  is 


4  4 

24  =  y-  r4)  =  y  — il 

k  liJ  k 


For  sequential  hypothesis  testing,  the  fusion  rule  can  be  ternary  to 
allow  the  occurrence  of  the  event  continue.  In  this  case,  the  number  of 


possible  fusion  rules  i3  obviously  more  than  24  and  is  given  by 

f - 11 - =  3<  . 

For  an  optimal  solution,  all  possible  fusion  rules  must  be  enumerated  and 
the  one  yielding  the  lowest  ASN^  must  be  chosen.  In  the  following 
section,  we  study  only  three  different  possible  fusion  rules  and  evaluate 
their  performance. 
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5.3.  Analysis  of  Some  Possible  Fusion  Rules 


The  fusion  rules  of  interest  are  those  which  take  into  account  the  de¬ 
cisions  of  both  local  detectors,  and  satisfy  the  monotone  property  of  the 
likelihood  ratio  function  at  the  global  level.  Therefore,  we  consider 
only  three  fusion  rules  and  analyze  them. 


i)  AND  Fusion  Rule:  According  to  this  fusion  rule,  the  global  de¬ 
cision  is  H  (u  =H. )  if  and  only  if  both  local  decisions  are  H  (u  =H  and 
1  g  1  ill 

U  =H  )  ,  i‘e-' 


(  V  if  U1=H1  and  u2=hi 

Ug  =  f(ui'u2>  V  if  U1  =  H0  °r  U2=H0 

■  continue,  elsewhere  . 


From  (5.6),  it  follows  that  the  global  error  probabilities  are  given  by 

“  =  “l  ’  °2  (5.7) 

(1-p)  =  (1-P1)  •  (1-P2) 

It  is  clear  that  depending  on  the  first  incoming  local  decision,  the 
global  decision  will  be  reached  no  later  than  the  decision  time  of  the 
second  incoming  local  decision.  Since  an  SPRT  terminates  in  a  finite 
number  of  steps  (stages)  with  probability  one  [1],  it  follows  that  a 
global  decision  will  be  reached  in  a  finite  number  of  steps  with  proba¬ 
bility  one. 


From  (5.7),  we  observe  that  >  a  and  P^  <  P,  k  =  1,2.  Also,  since 
a^,  P^  <<  1  we  can  further  approximate  (5.4)  to  obtain 

Ak  "  <  a  ^  A  (5.8) 

\  s  <  m  b 
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From  (5.8),  we  observe  that  the  use  of  AND  fusion  rule  requires  lower 

local  thresholds  than  the  centralized  test.  This  will  lead  to  a  shorter 

local  average  decision  time  (ASN‘  nen  H  is  true  and  a  longer  local 

average  decision  time  when  is  the  true  hypothesis. 

The  two  main  functions  usually  used  in  evaluating  the  performance  of 

an  SPRT  are  the  power  function  and  the  ASN  function.  The  power  function 

PDg is  defined  as  the  probability  of  deciding  when  the  actual 

parameters  of  the  distributions  are  0  and  9  instead  of  9  ,  and  9 

x  y  xl  yx 

respectively.  From  (5.7),  it  follows  that  P^  (0  ,9  )  is  simply  the 

Dg  x  y 

product  of  the  local  power  functions  P  (0  )  and  P  (9  )  at  the  first  and 

Dx  x  Dy  y 

second  local  sequential  detectors  respectively.  Therefore,  we  can  write 


P^  (0  ,0  )  =  P^  (0  )  ■  Pn  (0  ) 
Dg  x  y  Dx  x  Dy  y 


(5.9) 


The  computation  of  P  (0  ,0  )  is  thus  achieved  by  computing  P  (0  )  and 

Dg  x  y  J  ^  *  Dx  x 

?Dy (0y)  at  the  local  detectors  exactly  a3  described  in  [1]. 

The  second  function  of  interest  is  the  expected  duration  of  the  global 


test  denoted  by  E[T_^/HJ  when  is  the  true  hypothesis.  Let  T^  denote 

the  test  length  of  the  kth,k=l,2  sequential  detector.  Let  A  ,  B1,  C  ,  D1, 

n  n  n  n 

and  Fjj"  be  the  events  that  { T  =n,  T  >n/H.},  {T  >n,  T  =n/H . } ,  { T  =n, 
T2<n/Hi},  {T1<n,  T^n/H^},  and  {T^-T^n/H^}  respectively.  Observe  that 
if  a  global  decision  is  reached  at  the  nth  stage,  then  one  of  the  above 
mutually  exclusive  events  must  have  occurred.  When  the  first  incoming 
decision  is  the  global  test  will  terminate  without  the  need  to  wait  for 
the  othe>-  local  detector  to  decide.  This  corresponds  to  the  events  A*  and 
B^.  On  the  other  hand,  if  the  first  incoming  local  decision  is  H^,  then 
the  global  decision  maker  must  wait  for  the  other  local  detector  to 
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decide.  This  corresponds  to  the  events  C1  and  D1.  When  both  decisions 

n  n 

arrive  at  the  same  time,  the  event  F1  occurs .  To  compute  the  proba- 

n 

bilities  of  the  events  A1,  B1,  C1,  D1  and  F1  the  exact  probability 

n  n  n  n  n 

distribution  functions  of  T^  and  T^  when  H^, i=0, 1,  is  true  are  required. 
The  distribution  function  of  the  test  length  T^  is  known  [1,28,29]  only 
when  the  increments  v  as  defined  in  equation  (5.5)  are  integral 
multiples  of  a  constant.  In  this  case,  the  exact  probability  distribution 
function  of  T^  can  be  derived.  Let  ak<n)  and  P^(n)  be  the  probabilities 
that  the  kth  sequential  detector  terminates  at  the  nth  stage  when  is 
true  accepting  H  and  HQ  respectively,  i.e. 


c£<n)  =  Pr{Tk=n,  uk=  H^H.} 


k=l, 2  and  i=0, 1 


n  >  1 


(5.10) 


- Pc(  V"'  VW 


n  n 

Let  af(n)  =  y~  aNj)  andfiNn)  =  y~  pNj).  From  (5.6)  and  (5.10),  it 

pi  pi 

follows  that  the  probability  of  reaching  a  global  decision  at  the  nth 
stage  when  H.  is  true  is  given  by 


Pr{T^=n/H . }  =  P^(n)(l  -  a^(n)  -  P^ln)]  +  p^n)  [1  -  a^(n)  -  P^(n)] 

+  [p^  (n)  +  a^(n)]  a*(n-l)  +  (n-1)  [p^  (n)  +  a^n)) 

+  [aj(n>  +  pj(n)]  [aj (n)  +  P^ (n>  3  (5.11) 

Simplifying  (5.11),  we  obtain 

Pr{Tg=n/H.}  =  P^(n)[l  -  j^Oi-l)]  +  p*  (n)  [1-fjJ  (n>  ] 

+  aj(n)  (n)  +  a^n)  (5.12) 
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To  simplify  the  analysis,  we  restrict  the  local  detectors  to  be 
identical.  In  this  case,  T  and  are  identically  distributed  under 
both  hypotheses.  Therefore,  equation  (5.12)  can  be  simplified  further  to 
obtain 

Pr{  T  =n/H  . }  =  P1  (n)  [2-P1  (n)  -piNn-l)]  +  a1  (n)  [a1  (n)  -  aNn-lIJ  (5.13) 

g  i 


where  a1  (n)  =>  a^(n),p1(n)  =  PjNn),  a1  (n)  -  o£(n),  and  pNn)  =  Pk(n), 
k=0,l.  Equation  (5.13)  can  be  used  to  numerically  compute  the  average 
global  test  duration.  However,  it  does  not  provide  enough  insight  to  the 
problem.  We  observe  that  if  is  the  true  hypothesis,  then  the  obtained 
local  decision  will  satisfy  the  lccal  error  probabilities.  Therefore,  we 
can  write 


Pr{T  =n/H . }  =  Pr{u  =H„/H.}  •  Pr(A  )  +  Pr{u  =H  /H.}  •  Pr(B 

g  i  1  0  l  n  2  0  l  n 


(5.14) 


+  Pr{ u  =H, /H . }  •  PrtC1)  +  Pr{u  =H,/H.}  •  PrfD1)  +  PrtF1) 

2  1  i  n  111  n  n 

From  the  definitions  of  the  minimum  and  maximum  of  two  random  variables 
[29],  we  observe  that 

Pr{min (T. , T_)  -  n/1.}  =  Pr(A1)  +  PrCB1)  +  PrfF1) 

12  i  n  n  n 

and 

Prfmax (T, , T. )  -  n/H.}  -  Pr(C1)  +  Pr(DX)  +  Pr(FX) 

X  2  X  n  n  n 

Therefore,  it  follows  from  (5 . 14) -  (5 . 16)  that  T^  is  given  by 


Tg 


min(T,T„)  with  probability  P  .  (H.) 
12  min  i 


max(T  ,T„)  with  probability  P  (H.) 
12  max  i 


(5.17) 


when  H.  is  the  true  hypothesis,  and  where  P  .  (H.)  *  Pr(u,  »  H„/H.) 

i  mm  i  1  0  i 
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Prdu  =  H./H.)  =  Pr(u.  =  H./H.)  and  P  (H.)  =  Pr(u.  =  H./H.)  =  Pr(u.  = 

2  0  i  t  u  i  max  i  ill  2 

H^/HJ  ^  Pr  (ue  =  ‘  Elation  (5.17)  shows  that  the  global  test 

duration  is  either  the  maximum  or  the  minimum  of  the  local  test  durations 
depending  on  whether  the  first  incoming  decision  is  or  respec¬ 
tively.  From  (5.17),  it  follows  that  the  expected  global  test  duration 
conditioned  on  the  hypothesis  H  ,i=0,l,  is  as  follows: 


E(T  /H .  ]  =  Pr (u  =H  /H . )  .  E [min (T  , T  ) /H . ]  + 

gi  tOi  12i 

Pr(u£=H1/Hi)  .  E[max(T1,T2)  /HJ  (5.18) 

Thus,  the  problem  reduces  to  finding  the  expected  values  of  the  minimum 
and  the  maximum  of  the  local  test  lengths.  Due  to  the  functional 
complexity  of  the  distribution  of  and  T2  under  both  hypotheses,  an 
analytical  solution  does  not  seem  feasible.  However,  the  numerical  compu 
tation  is  relatively  simple  and  it  can  be  performed  via  matrix  multiplica 
tions  (28, 29] . 


ii)  OR  Fusion  Rule:  In  this  case,  a  global  decision  in  favor  of 
is  declared  if  any  of  the  local  decisions  is  H^,  while  is  decided 

globally  if  and  only  if  both  local  decisions  are  .  i.e., 


u 

g 


A 


f <u1'u2> 


I 


Hx  ,  if  u1=H1  or  u2=H1 

Hq  ,  if  u1=Hq  and  u2=H 
continue,  elsewhere 


0 


(5.19) 


As  in  the  case  of  AND  fusion  rule,  the  OR  fusion  rule  will  reach  a  global 
decision  no  later  than  the  time  at  which  the  second  local  decision 
reaches  the  fusion  center.  Since  the  local  decisions  are  reached  in  a 
finite  number  of  steps  with  probability  one,  it  follows  that  a  global 
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decision  will  be  reached  after  a  finite  number  of  steps  with  probability 
one  also.  Therefore,  we  can  express  the  global  error  probabilities  in 
terms  of  the  local  error  probabilities  using  (5.19)  as  follows 

(l-a)  =  (l-a  )  •  (i-a2) 


P  -  • 


P2 


(5.20) 


From  (5.20),  it  is  clear  that  the  global  power  function  P  (0  ,8  )  as  de- 

Dg  x  y 

fined  earlier  can  be  expressed  as  the  product  of  the  local  power  func¬ 
tions  P  (0  )  and  P„  (8  ) .  In  particular,  we  have 
Dx  x  Dy  y 


{1  " 


P„  (0  ,0  )} 

Dg  x  y 


=  {1  - 


P_  (8  )} 
Dx  x 


{1  - 


Pn  (0  )} 

Dy  y 


(5.21) 


Therefore,  the  computation  of  the  power  functions  is  similar  to  the  AND 
fusion  rule  and  it  can  be  done  as  in  [1]  at  the  local  level.  Equation 
(5.20)  can  be  used  to  show  that  the  OR  fusion  rule  increases  the  local 
thresholds.  Thus,  the  average  local  test  length  increases  when  H1  is 
true,  and  decreases  when  HQ  is  true. 

The  exact  probability  distribution  function  of  the  global  test  length 
T  can  be  derived  provided  that  the  exact  distributions  of  the  local 

g 

test3  are  known.  In  a  manner  similar  to  the  AND  fusion  rule,  we  can  show 
that 

Pr{ T  =n/H . }  =  c£(n)  [1  -  aNn-l))  +  0^  (n)  (l-a^(n)] 

g  i  1  1  2  1 

+  P^(n>  P2(n>  +  p2(n)Pi(n-1)  (5,22) 


If  we  assume  identical  local  detectors  and  use  the  fact  that  local 

decisions  must  satisfy  the  local  error  probabilities,  we  can  easily  show 

that  the  global  test  duration  is  given  by  (5.17).  However,  for  the  OR 

fusion  rule  P  .  (H.)  =  Pr(u,  =  H  /H . )  =  Pr(u.  =  H ,/H.)  =  Pr(u,  *  H ,/H.)  and 

min  i  111  21i  tli 
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P  (H.)  =  Pr  (u  =  H./H.)  =  Pr  (u_  =  H./H.)  =  Pr(u..  =  H„/H.). 
max  x  lOx  20i  t  0  x 

average  global  test  duration  is  given  by 


Hence,  the 


E [T  /H.]  =  Pr (u  =H  /K . )  .  E [max (T. , T. ) /H . ]  + 

gi  tux  12x 


Pr(u£=H1/Hi)  .  E[min(T  ,T  )/H  ] 


(5.23) 


which  is  similar  to  (5.18)  except  for  the  fact  that  HQ  is  replaced  by 
and  vice  versa. 


iii)  Three-Way  Fusion  Rule:  According  to  this  fusion  rule,  a 

global  decision  in  favor  of  is  declared  if  and  only  if  both  local 

decisions  are  H.,  i=0,l.  If  the  local  decisions  are  not  the  same,  the 
1 

global  decision  maker  can  either  use  a  randomized  decision  rule  to  stop 
the  test,  or  it  can  repeat  the  local  sequential  tests  without  keeping  the 
previous  test  results.  We  first  consider  the  randomization  option,  and 
in  this  case  the  three-way  fusion  rule  is  given  by 


f<Ul,u2) 


H^,  with  probability  1  if  u^u^H^ 

,  with  probability  y  if  u^  *  u^ 

Hg,  with  probability  ( 1— Y )  if  u^  *  u^ 
Hq,  with  probability  1  if  u^=u2=Ho 
continue,  elsewhere. 


(5.24) 


Once  again,  because  the  global  decision  according  to  (5.24)  is  reached  no 
later  than  the  time  at  which  the  second  incoming  local  decision  is 
obtained,  it  follows  that  global  decision  will  be  obtained  with  proba¬ 
bility  one  in  a  finite  number  of  steps.  The  global  error  probabilities 
are  expressed  in  terms  of  the  local  error  probabilities  as  follows: 
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(5.25) 


a  =  a1a2  +  ya^l-c^)  +  y(i-a  )a 
P  =  pxp2  +  (i-y>  d-p1)p2  +  (i-y)p1(i-p2) 


Simplifying  (5.25),  we  obtain 

a  =  d-2y)a1a2  +  y(a1+a2) 


(5.26) 


p  =  (2y-i)p1p2  -i  (l-y)  (Px+P2) 

Therefore,  for  a^=a2  and  y  =  0.5,  we  observe  that  each  local  detector 
must  satisfy  the  global  error  probabilities.  Because  the  global  test  can 
not  terminate  before  both  local  decisions  are  obtained,  it  follows  that 
the  global  average  test  duration  is  higher  than  that  of  a  single 
sequential  detector.  In  other  word3,  the  fusion  rule  in  (5.24)  will  de¬ 
grade  the  performance  instead  of  improving  it,  so  we  don't  consider  it 
any  further.  Turning  our  attention  to  the  second  option,  we  can  formu¬ 
late  the  fusion  rule  as  follows 


u 

g 


A 

at 


f <u1'u2) 


if  u^H^  and  u2=H 
Hq  if  and  u2»H 
continue,  elsewhere. 


1 

0 


(5.27) 


Let  a  trial  be  defined  a3  the  process  of  starting  the  local  sequential 
tests  until  the  two  local  decisions  are  reached.  Note  that  if  the  two 
local  decisions  disagree,  more  than  one  trial  will  be  required.  The 
successive  trials  have  identically  distributed  random  lengths  under  the 
hypothesis  H^,  i=0,l,  as  a  consequence  of  the  observations  being 
identical.  Each  trial  is  guaranteed  to  terminate  in  a  finite  number  of 
steps  w _th  probability  one.  However,  the  probability  is  one  that  a 
series  of  repeated  trials  will  terminate  within  a  finite  number  of  trials, 
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it  follows  that  the  sequential  process  will  terminate  in  a  finite  number 


of  steps  with  probability  one. 

The  global  error  probabilities  can  be  obtained  by  summing  their 
values  at  all  trials.  To  this  end,  let  p^  be  the  probability  of  a  suc¬ 
cessful  trial  and  =  1-p  the  probability  of  an  unsuccessful  trial  when 
H.  is  true.  We  obtain 

i 


p0  =  <*^2  +  <1-0^5  <1~a2) 

Pl  =  +  d-P2) 


The  probability  of  error  of  the  first  kind  a  is  therefore  given  by 


°  =  “ia2  +  Wo  +  Wo  +>'-  ’ 


“ia2 


(5.29) 


Similarly,  the  probability  of  error  of  the  second  kind  P  is  given  by 


P  =  M2  +  PiP2%  +  ei02<+---  = 


PA 


(5.30) 


The  global  power  function  P„  (0  ,8  )  can  be  derived  in  the  same  way  used 

Dg  x  y 

in  deriving  (5.29)  and  (5.30).  Recalling  the  definition  of  P  (0  ,0  ) 

Dg  x  y 

and  utilizing  the  assumed  independence  of  local  observations,  we  can 

express  P^  (0,0)  in  terms  of  P^  (0  )  and  P  (0  )  as  follows 
Dg  x  y  Dx  x  Dy  y 


pn  <0  > 

Dx  x 

PDg(0x'V  =  P<0  )  •  P„.  (0  >  +  (1  -  P„.  (0.)][1  -  P_  (0..)] 


Pn  (9  ) 

py  v 


(5.31) 


Dx  x 


Dy  y 


Dx  x 


Dy  y 


where  the  denominator  of  (5.31)  is  simply  the  probability  of  a  successful 

trial  when  0^  and  8^  are  the  true  values  of  the  local  parameters.  From 

(5.31),  it  is  evident  that  the  computation  of  P  (0  ,0  )  i3  equivalent  to 

Dg  x  y 
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the  computation  of  both  P„  (8  )  and  P„  (9  )  a3  described  in  detail  by 

Dx  x  Dy  y 

Wald  in  [1] . 

The  expected  duration  of  the  global  test  when  is  true  is  the  next 
function  to  be  derived.  Let  the  random  variable  representing  the  number 


of  observations  in  the  mth  trial  when  H^  is  the  true  hypothesis  be 

denoted  bv  D.  ,i=0,l;  m£l.  Recall  that  the  random  variables  D.  (ra>l) 
lm  im 

are  iid  with  an  expected  duration  E[D  ]  given  by 


E[DJ  =  E[max(T  ,T2)  /HJ 


(5.32) 

where  (5.32)  is  derived  by  observing  that  for  a  trial  to  reach  an  end, 
both  local  decisions  must  be  reached.  The  conditional  expected  global 
test  duration  can  be  written  as  follows: 


E (T  /H.,m)  =  m  E[D. )  (5.33) 

g  i  i 

However,  E  [T  /H.,mj  occurs  with  probability  qfm  ^p..  The  expected 
g  i  ii. 

global  test  duration  can  be  obtained  by  taking  the  expected  value  of 
(5.33),  and  the  result  is 

«  E[D  ] 

E  [T  /H.]  =  )  m  •  E  [D .  1  q.  p.  =  - —  (5.34) 

9  1  1  1-  l  P. 

m=l  i 

For  sufficiently  small  error  probabilities,  we  observe  from  (5.28), 
(5.29),  and  (5.30)  that  for  a  =a  and  |J  =*(J  =|Jg,  we  have  pi  ~ 
a£  >  a,  and  >  (3.  Hence,  unlike  the  two  previous  fusion  rules,  the 
effect  of  this  fusion  rule  is  to  decrease  the  upper  thresholds  and  to 
increase  one  iower  thresholds  which  in  turn  decreases  the  average  local 
test  length  under  both  hypotheses  to  a  value  smaller  than  that  of  a 
single  sequential  detector. 
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5.4.  The  Three  Local  Detectors  Case 


In  this  section,  we  consider  extending  the  distributed  system  in 

Fig.  5.1  to  include  three  local  sequential  detectors  as  shown  in 

Fig.  5.2.  The  successive  observations  at  the  third  sequential  detector 

are  denoted  by  a),  ,(0_,  . .  .,(0  =0)  (nSl)  and  they  are  assumed  to  be  iid 
12  n  ~ n 

random  variables.  We  denote  by  f  (CO,  8  )  the  pdf  of  the  random  variable 
(0  representing  the  observations  at  the  third  local  detector  and  assume 
that  (0  is  independent  of  both  x  and  y.  Each  local  detector  carries  out 
an  SPRT  based  on  its  own  observations  as  described  in  Section  5.2.  As  is 
evident  from  the  results  of  distributed  FSS  systems  [11]  and  the  dis¬ 
cussion  at  the  end  of  Section  5.2,  the  number  of  fusion  rules  increases 
quite  rapidly  as  the  number  of  local  detectors  increase.  In  our  work,  we 
will  confine  our  analysis  to  a  generalization  of  the  three-way  fusion 
rule  which  is  symmetric  with  respect  to  the  hypotheses.  Therefore,  a 
global  decision  in  favor  of  the  hypothesis  tL  will  be  declared  if  and 
only  if  the  three  local  detectors  have  reached  their  decisions  and  they 
are  all  favoring  H^,i=0,l.  The  fusion  rule  car.  therefore  be  formulated 
as  follows : 


u 

g 


A 


f(ui'u2'U3) 


H1  'if  VU2=U3=H1 

H0  'if  U1=U2=U3=H0 
continue,  elsewhere 


(5.35) 


As  in  the  case  of  two  local  detectors,  we  define  a  trial  as  the 
process  of  starting  the  local  sequential  tests  until  the  three  local 
decisions  are  reached.  If  the  local  decisions  disagree,  we  begin  another 
trial  ignoring  the  results  of  previous  trials.  Let  0^  and  (5^  be  the 
error  probabilities  at  the  jth  local  detector, j=l, 2, 3 .  Let  p^  be  the 
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Fig.  5.2:  A  distributed  system  consisting  of  three  local 
detectors  and  a  global  decision  maker. 
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probability  of  achieving  a  successful  trial  and 


1-p.  when  H.  is 
1  i 


true,  to  obtain 


30  =  TT  +  TT  <!  -  v 

j=l  3  j=l  3 

px  =  TT  +  TT  -  p^> 

j=i  3  j=i 


(5.36) 


The  global  error  probabilities  can  be  derived  following  the  same  approach 
used  to  derive  (5.29)  and  (5.30),  the  results  are  given  by  the  following 
equation 


(5.37) 


“  =  (  TT  «j)  /  Pc 

j=l  3 


p  =  ( TT  pJ  /Pi 

j=i  3 


The  global  power  function  P„  (0  ,0  ,0  )  is  a  function  of  the  local  power 
*  Dg  x  y  0) 

functions  P„  (0  ) ,  P^  (0  )  and  P„  (0  )  as  given  by 
Dg  x  Dy  y'  DO)  (0  *  1 

(0  )  Pn  (0 

p  6  ,=  _ Dx  x  Dy  y  DO)  0) _ 

Dg  x'  y'  0)  P  (6  )P  (0  )P  (0  )  A  [1-  P„  (0  ))[1  -  P^  (0  )J[1-P„  (0  )] 
y  Dx  x  Dy  y  DO)  0)  Dx  x  Dy  y  DO)  0) 

(5.38) 

If  we  assume  identical  observations  at  all  local  detectors  and  equal 
local  performance,  i.e.,  a,=a.  and  P .=P», j=l, 2, 3.  It  follows  that  the 

j  4  j  4 

local  test  lengths  T  ,j=l,2,3,  are  identically  distributed  under  the 
hypothesis  H^,  i=0,l.  Moreover,  we  can  cpproximate 


a  -  or  s  3Va 

P  =  P^  or  P£  ^  3Vp 


(5.39) 


which  can  be  used  to  show  that  E(T./H.],  j=l,2,3;  i=0,l,  is  approxi- 

D  i 
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mately  equal  to  one-third  of  the  corresponding  value  in  the  centralized 


(one  detector)  case. 

The  average  global  test  duration  E[T  /H.)  when  H.  is  the  true  hypo- 

g  1  x 

thesis  can  be  derived  in  almost  the  same  way  used  for  the  case  of  the 

local  detectors.  However,  the  derivation  is  more  complicated  due  to  the 

fact  that  our  trials  are  of  two  different  types  each  requiring,  on  the 

average,  different  number  of  observations.  In  particular,  if  the  first 

two  incoming  local  decisions  are  different,  then  a  trial  can  be 

terminated  requiring  on  the  average  observations.  On  the  other  hand, 

if  the  first  two  incoming  local  decisions  are  the  same,  then  a  trial  must 

continue  until  the  third  local  decision  is  obtained.  Let  T„ ,  denote  the 

3i 

average  number  of  observations  in  the  trial  in  which  the  first  and  second 
incoming  local  decisions  are  identical.  Let  7 aid  7  denote  the  proba 
bility  that  when  H.  is  true,  our  trial  is  unsuccessful  and  requires 
and  T^^  average  number  of  observations  respectively.  Based  on  that,  we 
can  write 


:  [T  /H. ]  = 

g  1 


T3.,  with  prob.  1-Y2i-Y3i 
T2i  +  E[T^/HJ  ,  with  prob. 


i=0, 1  (5.40) 


T  .  +  E (T  /H. j 
3i  g  1 


with  prob.  7 


3i 


Simplifying  (5.40),  we  obtain 


E (T  /H. )  =  T_. (1  + 
g  1  3i 


Y3i 


1  *  Y3i  '  Y2i 


-]  +  t2.  [ 


Y2i 


-3  (5.41) 


1  -  Y3i  ~  Y2i 


To  simplify  the  numerical  computation,  we  can  derive  an  upper  bound  on 
E  (Tg/H^ ]  .  The  upper  bound  is  arrived  by  equating  T^  to  T^and  observ 
ing  that  T  >  T^  where  equality  holds  if  and  only  if  the  last  two 
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incoming  local  decisions  are  obtained  at  the  same  time.  By  recognizing 
T  ^  to  be  the  expected  value  of  the  maximum  of  the  three  local  test 
lengths,  we  arrive  at  the  following  inequality 


E (T  /H.]  < 

g  i 


E[max(T1,T2>T3)/H.] 

*1 


(5.42) 


The  inequality  (5.42)  is  tight  for  sufficiently  small 
bilities;,  in  this  case  both  y  and  y are  extremely 
age  global  test  length  is  dictated  by  the  probability 
trial  p.  z  1. 

i 


local  error  proba- 
small  and  the  aver- 
of  a  successful 


5.5.  Numerical  Example 

We  consider  the  case  of  two  identical  local  detectors.  The  observa¬ 
tions  are  assumed  to  be  iid  both  spatially  and  temporally.  Let  the 
observation  model  be  as  follows: 


V  xi  3  Yi  -  B(V 


H  :  X.  =  Y.  _  B(P, ) 
1  i  x  1 


P,  =  1-P,  *■  0.5 
1  0 


wnere  B(PJ  is  a  Bernoulli  random  variable  with  parameter  P^ .  Therefore, 
the  local  test  can  be  formulated  as  a  random  walk  on  the  states  numbered 
0,1,2,...,N.  The  states  0  and  N  are  absorbing  states  with  local 
decisions  and  respectively  while  the  remainder  of  the  states  are 
transient  states.  Starting  the  process  at  the  kth  state,  it  is  known 
(29]  that 
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and 


Pc‘“t  ■  W 


(d-p,)/p.)k  -  (d-p.)/p.lN 

_ 1  1 _ X  1 

1  -  ((1-P.)/P,)N 
1  1 


Pr(u£  =  f^/H  )  =  1  -  Pr(u£  =  f^/HJ 


In  the  example  we  choose  P  =  1-P  “  0.6667.  The  values  of  k  and  N  are 

chosen  to  yield  the  desired  values  of  the  local  error  probabilities  0^ 
and  .  Let  R  =  [00  ...  1  ...  0]  be  a  row  vector  of  size  N+l  whose 
elements  are  all  zero  except  at  the  kth  position.  Let  C  =  [10  ..  001]  be 
a  column  vector  with  elements  that  are  all  zero  except  for  1  at  the  0th 
and  the  Nth  positions.  Let  P^  be  the  transition  probability  matrix  [29] 
when  is  true,  i=0,l.  As  was  pointed  out  in  [29],  the  probability 
that  the  jth  local  test  will  terminate  before  or  at  the  nth  stage  is  given 
by 

Pr{T,  <  n/Hi}  =  R  P?C  ;  i-0,1;  j-1,2 

Using  the  concept  of  tail  probabilities  [29],  the  expected  value  of  T^ 

when  H.  is  true  can  be  derived  as  follows 
i 

OO 

E (T  . /H . ]  =  Pr{ T  .  >  n/H.} 

n=0 

Therefore,  we  can  write 

OO 

E (T  .  /H , ]  =  Y"  {1  -  R  P?C} 
j  l  L—  - 1— 

n=0 

which  gives  the  ASN  of  the  jth  local  test  when  is  the  true  hypothesis 
To  derive  the  ASN's  of  the  mind^T^)  and  the  maxd^rT  ),  we  observe  that 
T^  and  T0  are  iid,  and  hence 
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Pr{min<Tl/T2)  >  n/H^  =  {1  -  R  P^C}2 
Pr{max(T1,T2)  <  n/H^}  =  [R  P^C] 2 

From  those  two  equations  and  using  once  again  the  tail  probabilities,  we 
obtain 

oo 

E[min(TlfT2)/H  ]  =  Y  (I  “  B  i^C}2 

n=*0 

CO 

E[max(T  ,T  )/H  ]  =  Y_  {1  -  [R  p"c]2) 

n=0 

The  numerical  results  obtained  for  a  single  detector,  the  simple  multi¬ 
sensor  scheme  in  Chapter  Four,  and  the  various  fusion  rules  are  given 
below  for  some  values  of  a  and  (3. 


Single  Detector  The  Simple  AND  OR  Three-Way 


a  =  3  =  3.891  x  10'3 

Scheme 

E[T  /H  ] 

g  i 

23.81 

16.89 

16.44 

18.56 

15.98(67%) 

E<yv 

23.81 

16.89 

18.56 

16.44 

15.98(67%) 

a  =  2.43xl0_\  3=3 

.905X10-3 

E[T  /H  ] 

g  i 

35.76 

26.97 

23.75 

27.85 

24.18(67.6%) 

BIT  /HJ 

g  ° 

24 

16.94 

19.32 

16.59 

16.25(67.7%) 

a  =  3  =  1-526  X  10~5 

E[T  /HI 

g  i 

48 

37.5 

31.05 

40.04 

31(64.5%) 

EiyH0] 

48 

37.5 

40.04 

31.05 

31(64.5%) 

a  -  5.96X10"8,  3=1. 

526X10"5 

E[T  /HI 

g  i 

71.71 

58.95 

44.9 

61.28 

44.9(62.4%) 

E[T  /H_] 

g  o 

48 

37.53 

40.17 

31.00 

31.03(64.6%) 

a  =  3  =  2.33  x  10~iU 

E[T  /HI 

g  i 

96 

80.79 

58.45 

83.54 

58.45(61%) 

E1VV 

96 

80.79 

83.54 

58.45 

58.45(61%) 
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5.6.  Discussion 


A  distributed  system  consisting  of  two  local  sequential  detectors  and 
a  global  decision  maker  was  considered.  The  global  error  probabilities 
are  shown  to  be  functions  of  the  local  error  probabilities  as  well  as  of 
the  fusion  rule  used.  The  global  power  function  was  shown  to  be  a  simple 
function  of  the  local  power  functions  for  a  given  fusion  rule.  For  two 
identical  local  detectors,  the  average  global  test  length  was  derived  in 
terms  of  the  local  test  lengths.  The  case  of  three  local  detectors  was 
considered  briefly  and  an  upper  bound  was  derived  for  the  average  global 
test  length.  The  numerical  results  obtained  show  that  the  AND  fusion 
rule  performs  better  than  the  OR  fusion  rule  when  is  true,  and  vice 
versa  when  HQ  is  true.  On  the  other  hand,  the  Three-way  fusion  rule  com¬ 
bines  the  advantages  of  both  fusion  rules  and  performs  quite  well  under 
both  hypotheses,  with  better  performance  for  smaller  error  probabilities . 
Moreover,  the  Three-Way  fusion  rule  has  better  performance  than  the 
simple  sequential  scheme  presented  in  Chapter  Four  where  no  explicit 
fusion  rule  was  employed. 
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CHAPTER  SIX 


A  MODIFIED  DISTRIBUTED  SEQUENTIAL 
HYPOTHESIS  TESTING  PROCEDURE 
WITH  DATA  FUSION 


6.1.  Int  roduct ion 

In  binary  signal  detection  theory,  sequential  hypothesis  testing  pro¬ 
cedures  [1,2]  provide  a  significant  advantage  over  fixed-sample-size  (FSS) 
test  procedures.  For  the  same  error  probabilities  a  and  P,  the  sequential 
procedures,  on  the  average  require  substantially  less  number  of  observa¬ 
tions.  However,  the  sequential  procedures  are  usually  not  easy  to  imple¬ 
ment  and  require  a  random  number  of  observations  before  they  terminate. 

In  other  words,  the  sequential  procedures  do  not  give  any  definite  upper 
bound  on  the  number  of  observations  required  for  decision.  Such  a 
situation  will  limit  the  practical  use  of  sequential  procedures  and 
necessitate  truncation.  The  truncation  problem  [31-33]  is  not  easy  to 
analyze  except  for  the  simple  procedure  given  in  [21] . 

The  SPRT  was  generalized  in  [25,26]  for  the  detection  of  M  hypotheses 
with  different  means  of  the  normal  distribution.  As  was  pointed  out  in 
[25],  the  generalized  SPRT  is  not  easy  to  implement,  and  its  performance 
is  difficult  to  evaluate.  Furthermore,  it  should  be  mentioned  that  the 
sequential  test  in  [25]  is  not  guaranteed  to  be  optimal.  In  [27], 

Fleisher  et  al.  generalized  the  memory less  grouped-data  sequential  (MLGDS) 
test  procedure  of  Lee  and  Thomas  [21]  to  the  case  of  multiple  hypotheses 
with  different  means/var iances  of  the  normal  distribution.  The  sequential 
testinc  procedure  in  [2”]  exhibits  a  performance  superiority  over  the 
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optimal  F5S  test  while  maintaining  a  much  simpler  structure  and  analysis 
than  the  sequential  test  in  (25] . 

In  Chapters  Four  and  Five,  we  have  studied  some  decentralized  sequen¬ 
tial  hypothesis  testing  schemes.  The  numerical  results  obtained  show  that 
the  scheme  in  Chapter  Five  has  a  better  performance  than  that  in  Chapter 
Four.  However,  the  derivation  of  the  average  global  test  length  is  quite 
laborious  and  no  simple  truncation  scheme  is  available.  Motivated  by 
this,  we  consider  another  decentralized  sequential  testing  procedure. 

The  distributee  sequential  testing  procedure  is  a  generalization  of  the 
MLGDS  procedure  in  [21]  to  a  distributed  environment  as  described  and 
analyzed  in  Section  6.2,  for  the  binary  hypothesis  case.  In  Section  6.3, 
we  consider  the  case  of  M  hypotheses.  Section  6.4  contains  a 
description  and  analysis  of  the  truncation  scheme  applied  to  the  binary 
hypothesis  case.  Numerical  results  are  presented  in  Section  6.5,  and 
finally  this  chapter  is  concluded  in  Section  6.6  where  we  discuss  our 
results . 

6.2.  The  Modified  Distributed  Sequential  Test  for  Binary 

Hypothesis  Testing 

We  consider  a  distributed  system  consisting  of  two  local  detectors 
(LDs)  and  a  global  decision  maker  (GDM)  as  shown  in  Fig.  6,1.  The  LD3 
are  not  allowed  to  communicate  with  each  other.  Binary  hypothesis  test¬ 
ing  problem  is  considered  with  equally  probable  hypotheses.  At  any 

observation  time,  each  LD  takes  a  package  of  observations  of  size  N  ,  and 

o 

makes  a  decision  based  on  its  own  observations.  The  local  decision  is 
either  in  favor  of  one  of  the  two  underlying  hypotheses,  or  an  indecision 
(ignorance)  is  declared.  The  two  local  decisions  are  communxcated  to  the 
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Fig.  6.1:  A  distributed  system  consisting  of  two  local 
detectors  and  a  global  decision  maker. 
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GDM,  which  in  turn  combines  them  according  to  the  specified  fusion  rule 
to  come  up  with  the  global  decision.  In  this  chapter,  we  are  interested 
in  sequential  testing  procedures.  Therefore,  the  global  decision  is 
either  one  of  the  hypotheses  or  is  to  continue.  When  the  global  decision 
is  in  favor  of  one  of  the  hypotheses,  the  local  as  well  as  the  global 
tests  are  terminated.  However,  when  the  global  decision  is  to  continue, 
the  local  MLGDS  tests  are  repeated  discarding  the  results  of  all  previous 
stages . 

Let  x^,x2'**.  136  a  serene®  of  iid  random  variables  which  represent 
the  successive  observations  at  local  detector  #1.  Similarly,  let 


y  ,y  ,...  be  a  sequence  of  iid  observations  at  local  detector  #2.-  The 
observations  are  assumed  to  be  spatially  iid.  The  hypothesis  repre¬ 
sents  the  absence  of  a  signal  while  the  alternative  represents  the 
presence  of  the  signal.  The  observation  model  is  given  by 

H,  :  x.  =  9  +  n. 

1  i  i 


y,  =  0  +  n. 

i  i 


i  >  1 


(6.1) 


H  :  x.  -  n. 
Oil 


y.  “  n. 

i  i 


where  0  is  the  term  representing  the  signal  and  n^  is  the  additive  noise 
component.  We  assume  that  the  noise  is  normally  distributed  with  mean 
zero  and  variance  <72 ,  i.e.,  n^  ..  N(0,a2).  The  observations  are  taken  at 
the  LDs  in  groups  of  size  N  each.  Therefore,  the  test  statistics  at 


LD  #  1  and  LD  #  2  are  given  by 


(6.2) 


(6.3) 
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respectively.  Based  on  the  local  sufficient  test  statistics  T  (x  )  and 

N  —  N 
o  o 

tn  (Yn  the  hypothesis  testing  problem  is  modeled  by: 
o  o 


H1 : 

T„  (xK,  ) 

.  K(N  8 

,  N  02  ) 

1 

N  -N 

0  0 

O 

O 

T  (y  ) 
N  XN 

0  0 

_  n(n  e 
0 

,  NO2) 
0 

H_: 

Tk,  <Xv,  > 

_  n(0 

,  N  O2) 

0 

N  -N 

O  0 

O 

T  (y  ) 
N  XN 

0  0 

_  N  (0 

,  N  O2) 
O 

(6.4) 


For  equal  global  error  probabilities  a  and  (3  (where  a  is  the  probability 

of  deciding  when  is  true,  and  p  is  the  probability  of  deciding 

when  is  the  true  hypothesis) ,  it  follows  from  the  symmetry  of  the 

problem  that  the  local  thresholds  are  symmetric  around  the  point  N^G/2. 

Let  A_.  and  B_.,  j  =  l,2,  be  the  upper  and  lower  thresholds  at  the  jth  LD 

respectively.  Therefore,  we  can  write 

A.  =  N  0/2  +  C. (N  ) 

1  o  30 

j  -  1  ,  2  (6.5) 

B  .  =  N  e/2  -  C  .  (N  ) 

30  jo 

where  C  .  (N  )  is  a  constant  which  depends  on  the  package  size  N  for 
30  r  o 

specified  error  probabilities.  Following  the  convention  in  the  litera¬ 
ture  [7-13],  we  assume  identical  local  detectors.  Under  the  above  assump¬ 
tion,  equation  (6.5)  can  be  rewritten  as  follows 


A  =*  A  =A  =  N  0/2  +  C(N  ) 
1  2  o  o 


(6.6) 


B  =  B  =B„  =  N  G/2  -  C (N  ) 
1  2  o  o 
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Let  p.  =  Pr{T,  (x  )  2  A/H,}  and  q.  =  Pr{T  (x  )  <  B/H.}  for  i=0,l. 
i  N  — n  i  l  N  -n  i 

o  o 

It  follows  that 

pl  =  qQ  =  0(B /ViT  cr)  (6.7) 

p0  =  qx  =  1  -  <t  A/VNq  CT) 


where  <!>(•)  is  the  cdf  of  the  unit  normal  random  variable  N(0,1).  At  any 
stage  of  the  test,  LD  #1  computes  its  own  test  statistic  using  only  its 
current  group  of  observations  and  performs  the  following  test: 


f  >  A  ,  =  1 

tn  %,  >  j  5  6  ,  ^  =  0  (6.8) 

'  otherwise  ,  =  I 

where  u^  denotes  the  decision  of  LD  #1,  and  I  represents  indecision  in 
the  three  level  quantizer.  The  procedure  at  LD  #2  is  identical  to  that 
performed  at  LD  #1.  Therefore,  the  two  local  decisions  are  iid  discrete 
random  variables.  In  particular,  the  jth  local  decision  Uy  j“l,2,  under 
the  hypothesis  H^,  i  =  0,1,  is  distributed  as  follows: 

1  ,  with  prob.  p.  .  „ 

l  0=1,2 

L,  :  u,  =  0  ,  with  prob.  q,  (6,9) 

1 

J  ^  i=0,l 

I  ,  with  prob.  r_^ 


where  r.  =  1  -  (p.  +  q.).  The  GDM  observes  only  the  set  of  incoming 
i  i  i 

local  decisions  and  combines  them  according  to  some  fusion  rule.  The 
fusion  rule  g(u  ,  u^)  must  satisfy  the  monotone  property  of  the  global 
likelihood  ratio  (LR)  function  and  admit  three  different  courses  of 


action  (decide  H^,  H^,  or  continue) .  Motivated  by  the  results  of  the 
previous  chapter,  we  propose  to  use  the  following  symmetric  fusion  rule 
which  satisfies  the  above  requirements 
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(6.10) 


.  u^u^l  ,  stop  and  decide  H 

A  u  =u  =0  ,  stop  and  decide  Hq 

U  =  Cf  (u  ,  U  )  l  ^  * 

g  12  otherwise,  discard  all  previous 

local  decisions  and  continue. 

where  u  denotes  the  global  (final)  decision. 

9 

The  probability  of  error  of  the  first  kind,  a,  can  be  computed  by  sum¬ 
ming  the  probabilities  of  deciding  H  at  all  stages  when  Hg  is  the  true 
hypothesis.  The  successive  stages  are  identically  distributed  each  with 
the  probability  of  a  hypothesis  decision  (deciding  either  or  H  ) 
equal  to  (p2  *  q^)  and  a  probability  of  continue  equal  to  1  -  (Pg+  <Ig)  • 
Therefore,  we  can  write 


u  =H^  at  the  kth  stage/Hg  is  true} »Pr{ the  kth  stage  is  reached}. 
k=l  ^ 


Let  Pr(e)  be  defined  as  the  average  probability  of  error,  i.e., 

Pr(e)  =  a  Pr{Hg  is  true}  +  [3  PrfH^  is  true} 

Since  a=P,  it  follows  that  Pr(e)  =  a  =  (3,  and  the  probability  of  correct 
decision  is  1  -  Pr(e)  *  (q^/ (Pg  +  q2)] .  Inspecting  (6.11)  closely,  we 
observe  that  in  order  to  minimize  a  ■  Pr(e)  and  maximize  PQ  =  l-|3=l-Pr(e) 
simultaneously,  we  must  minimize  their  ratio,  i.e.,  (Pg/qg)2  =  (Pg/p^)2- 
Similarly,  we  can  minimize  P  and  maximize  (1-a)  =  l-Pr(e)  simultaneously 
by  minimizing  the  ratio  (q^/Og)2.  However,  the  quantity  (Pg/p^)2  or 
equivalently  (Pg/p^)  is  minimized  in  the  region  R^  of  the  local  observa¬ 
tion  space  such  that  the  local  LR  is  maximum.  Similarly  the  quantity 
(qi/q0)2  or  equivalently  (q1/qg)  i3  minimized  in  the  region  Rg  of  the 
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local  observation  space  where  the  local  LR  is  minimum.  Because  the  local 
decisions  are  ternary  by  definition,  it  follows  that  is  the  remainder 
of  the  observation  space  as  shown  in  Fig.  6.2.  Therefore,  we  conclude 
that  an  optimal  local  test  must  necessarily  be  a  likelihood  ratio  test 
that  partitions  the  observation  space  into  three  mutually  exclusive 
regions  R  ,  RJ#  and  such  that  LR (RQ )  <  LR (R^)  <  LR(R^) .  When  the 
observation  falls  in  R^,  i  =  0,  I,  1,  the  corresponding  local  decision  is 
u^  =  i;  £  =  1,2.  The  independence  assumption  implies  that  the  joint  LR 
function  is  the  product  of  the  local  likelihood  ratios  LR^  and  LR^  at  LD 
#1  and  LD  #2  respectively.  Moreover,  the  global  decision  maker  observes 
only  the  set  of  incoming  local  decisions.  Therefore,  we  conclude  that 
the  optimal  global  test  (fusion  rule)  is  a  LR  test  performed  on  the  set 
of  local  decisions  as  given  by  (6.10).  At  this  stage,  it  3hould  be 
emphasized  that  because  the  observations  are  normal,  the  tests  in  (6.5) 
and  (6.6)  are  likelihood  ratio  tests,  and  hence,  they  are  optimal. 

Returning  to  equation  (6.11),  we  observe  that  when  is  the  true 

hypothesis,  the  probability  that  the  global  test  will  terminate  at  the 

k-*  1 

kth  stage  (k  >  1)  is  given  by  (1  -  (p^  +  q^)  ]  (p^+  q^)  .  Therefore, 

the  average  sample  number  (ASN)  is  the  same  under  both  hypotheses  and  is 
given  by 


ASN  =  N 


f  k  ( 1  - 


<P 


K=1 


2>,k-l 
+  ^0>] 


(P0  + 


V  = 


(po 


(6.12) 


It  is  clear  from  (6.11)  that  p^  <  a,  or  equivalently  p^  <  'la  for  any 
value  of  a.  Moreover,  (6.11)  can  be  written  as  follows: 


pQ  =  'I a/(i-a)  qQ  =  Va/(i-a)  px 


(6.13) 
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R0(Uj=O)  R,(uJ=l)  R,(Uj»l) 

J  C  N^=C  ^9 
LR(R0)<  LR(Rj)  <LR(Rj) 

Fig.  6.2  :  Partitioning  of  the  j1*1  local  observation  space 
to  minimize  error  probabilities. 
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For  any  value  of  p  ,  the  package  size  Nq  is  [21]  given  by 


or  equivalently 


No  =  2  C*1*-1  <1-po)  +  <t>'1  P0)2  (6.14) 


No  =  <f)2  (°  1  Wa/(l-a)p0)  -  <t>  1  (pQ ) ) 2  (6.15) 


where  <X>  ( • )  is  the  inverse  of  <!>(•).  Substituting  for  from  (6.15)  in 


(6.12),  we  obtain 


no  =  a<^)2(<t>1  (Va/(i-a)p0)  -  C>  (pQ))2/p2  (6.16) 


The  package  size  is  a  monotonically  increasing  function  of  p^  and  there¬ 
fore,  for  each  value  of  N  there  exists  a  unique  value  of  p  that  will 

o  0 

satisfy  the  error  probabilities.  To  show  that,  we  differentiate  (6.15) 

with  resDect  to  p,,  to  obtain 
0 


-  =2  <|)2  (<t>  1  (Va/d-o)  pQ-  *1(P0)) 

(6.17) 

(V(i-a)/a  •  - -i  1  ■  = - - ) 

1  4>  [O  (V(i-o)/ap)]  4>  [<t»  (P0>lj 


where  ij)  is  the  unit  normal  density  function.  The  term  [<1>  V(l-a)/a  p^  - 
<t>  1  <Pq )  3  is  positive  because  V(l-a)/a  p^  >  p^  for  values  of  a  of  interest 
(a  <  0.5).  Rewriting  (6.17),  we  obtain 


=  {2  <^)2  (<1>  1  (Va/(l-0)  p  -<D1(p  ))}  /  {<(>[<&  1  (p Q)  ]} 


Ja/ (1-a) 


♦  (O'1  (P())  l 

4>  (O  (V  (l-a)  /a  p  )  ] 


(6.18) 
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The  first  term  on  the  right  hand  side  of  (6.18)  is  positive  because  the 
numerator  and  denominator  are  both  positive.  The  term  ij)  [<t>  ^  (p  )  ]  / 

0(<I>  1  V  (1-00  /a  Pq)]  is  recognized  as  the  inverse  of  the  LR  function  at 
the  upper  threshold  A.  The  LR  function  is  strictly  monotonically  increas¬ 
ing  function  of  its  argument.  Therefore,  it  follows  that 


pl  A  Pr{\  -N J  ~  A/Hl}  ♦[<jT1(V(l-o)/o  p  )] 

T - —  (6.19) 


Pr{T  (x  )  >  A/Hj 


N  -N 
o  o 


M*  <P0>  3 


From  (6.13),  we  have  P^Pq  =  V(l-a)/a  ,  and  thus 


Va/ (l-a) 


4>  1(P0>] 


♦  (^(l-ai/a  pQ)  ] 


-  l  >  o 


(6.20) 


dN 

We  conclude  from  (6.20)  and  (6.18)  that  is  strictly  positive  for  all 

Pq  <  a.  Consequently  Nq  is  a  monotonically  increasing  function  of  p  .  The 
ASN  as  given  in  (6.16)  is  also  a  function  of  p^  which  can  be  written 


simply  as  follows 


ON 

ASN  =  "  2~ 


(6.21) 


It  is  clear  that  an  optimal  choice  of  the  package  size  Nq  or  equivalently 
Pq  will  minimize  the  ASN.  Let  N*  and  p*  be  the  optimal  package  size  and 
its  corresponding  value  of  Pq .  Differentiating  (6.21)  with  respect  to  Pq 


and  setting  the  result  equal  to  zero,  we  obtain 


dN  2N 
o  o 


0 


(6.22) 


Solving  (6.22),  we  obtain  the  optimal  package  size  N*.  However,  an 
analytical  solution  for  N*  does  not  appear  to  be  feasible,  and  one  needs 
to  consider  numerical  search.  In  [21],  it  was  shown  numerically,  that  for 
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- 1 8  ”2 

10  <  a  <  10  ,  the  choice  N  -  N  /3  is  nearly  optimal,  where  N  is 

O  FSS  FSo 

the  number  of  observations  required  by  the  optimal  FSS  detector  to  satisfy 

the  same  error  rates.  In  our  case,  since  two  local  detectors  are  being 

used,  we  can  restrict  the  numerical  search  to  the  set  of  integers  - 

N  <  N  /3,  The  upper  limit  N  /3  represents  the  case  when  the 
o  FSS  FSS 

decentralized  system  is  equivalent  to  one  detector  (no  gain  in 

performance),  while  the  lower  limit  N  /6  represents  the  case  when  the 

FSS 

decentralized  system  is  equivalent  to  the  centralized  case  (transmission 
of  the  analog  observations) . 


6.3.  The  Modified  Sequential  Test  for  M-ary  Hypothesis  Testing 

Once  again  we  consider  the  distributed  system  shown  in  Fig.  6.1.  The 
observations  are  as  defined  in  the  previous  section  except  that  there  are 
M  equally  probable  hypotheses.  In  particular,  the  observation  model  is 
as  follows:  , 


H 


k  '' 


x .  =  k9  +  n . 

1  1  i2  1  ;  k  =  0,1,.. . ,M-1 

y .  =  k8  +  n . 

i  i 


(6.23) 


where  k9  is  the  constant  signal  component  under  the  hypothesis  and  n^ 
is  as  defined  earlier.  The  local  test  procedures  are  identical,  there¬ 
fore,  we  describe  only  the  test  procedure  at  LD  #1.  At  any  stage  of  the 
test,  a  package  of  observations  is  taken  and  a  sufficient  statistic 

T  )  is  computed.  Let  the  pdf  of  the  test  statistic  T  (x  )  be 

o  o  o  o 

denoted  by  f (t/H^)  when  is  the  true  hypothesis.  The  local  te3t  is 

obtained  by  generalizing  the  test  for  the  binary  hypotheses  case  as  given 

by  (6.8) .  It  is  known  [34]  that  for  the  general  case,  where  the 

M(M-l) 

hypotheses  are  not  ordered,  we  require  a  set  of  - - -  pairwise  likelihood 

ratio  tests.  However,  the  hypotheses  in  (6.23)  are  ordered  and,  there- 
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fore,  we  need  to  test  between  adjacent  hypotheses  only.  The  resulting 
test  can  be  summarized  as  follows:  accept  the  hypothesis  if  both  of 
the  following  two  likelihood  ratio  tests. 


flt/V  ] 

X 

u 

=  T 

-1 

accept 

Hk+: 

£<t/Hk)  i 

{  ^ 

X£ 

=  T 

accept 

Hk 

f  ^ 

t 

u 

=  I 

-1 

accept 

Hk 

l  s 

Zz 

=  X 

accept 

Vi 

lead  to  the  acceptance  of  H  .  The  choice  of  the  thresholds  in  (6.24)  is 
intended  to  maintain  the  symmetry  of  the  local  te3t,  i.e.,  the  local  test 
weighs  all  the  hypotheses  equally.  Let  and  be  the  probabilities  of 
deciding  and  respectively,  when  is  the  true  hypothesis. 

It  is  clear  from  (6.23)  and  (6.24)  that  a,  and  B,  are  the  same  for  all 
values  of  k  and  that  =  (5^  is  maintained.  For  sufficiently  small 
values  of  the  average  error  probability,  we  can  neglect  the  errors  that 
can  occur  between  nonadjacent  hypotheses.  In  this  case,  it  can  be  shown 
that  the  test  in  (6.24)  is  optimal  and  indeed  minimizes  the  local  average 
probability  of  error.  Observe  that  the  thresholds  in  (6.24)  are  dif¬ 
ferent  from  those  in  (27) .  The  test  in  (27)  uses  thresholds  that  are 
identical  to  those  of  the  sequential  te3t  in  [25,26]  and,  therefore,  they 
are  not  necessarily  optimal.  Simplifying  equation  (6.24),  we  arrive  at 
the  following  equivalent  tests 


o 


A 


N 

o 

I 

i=i 


X. 

1 


,  N  (2k+l) 0 
>  — - 

N  (2&+l)0 

<  -2 - 

"  2 


+  c(N  ) 
o 

-  5(N  ) 
o 


accept  H,  , 
k+1 

accept 


(x  ) 
-N 

o 


A 


N 

o 

Y 

£=1 


x . 
i 


N  ( 2  k— 1 ) 0 
>  — - 

'  N  (2l-l ) 0 

<  -2 - 

~  2 


+  C(N  ) 
o 

-  C(N  ) 
o 


accept 
accept  Hk_1 


(6.25) 
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From  (6.25),  it  is  clear  that  the  local  test  is  simply  to  determine  the 


region  of  the  observation  space  in  which  T  ,  (xVT  )  falls  and  decide 

N  — N 
o  o 

H,  (u.  =  k)  if  T  (x  ,  )  e  R,  .  The  regions  R,  ;  k=0, 1, . . . ,M-1,  are  as 
k  1  N  —  N  k  k 

o  o 

summarized  below: 

R,  =  [N  k0  -  c  ,  N  k9  +  c]  ;  k-1, 2, . . . ,M-1 

k  o  o 

Rq  =  (-oo,  c]  (6.26) 

R  =  ( (M-l)  N  0  -  C  ,  “) 

M-l  O 

I,  ,  =  (kN  9  +  c  ,  (k+1) N  9  -  c)  ;  k  =  l,2,...,M-2 

x,k+l  o  o 


where  c  =  N  9/2  -  c  (N  )  is  a  real  number  whose  value  depends  on  N  and 
o  o  o 

satisfies  c  <  N  9/2.  We  define  the  indecision  region  I  to  be  the  union 
o 

of  all  indecision  regions  I  ;  k  =■=  0,1, ...,M-2.  Let  e(j,E)  be  the 

f  )C  i  X 

probability  that  (xn  )  e  R.  when  is  the  true  hypothesis,  i.e. 

o  o  3 


e(j,E)  =  Pr{ux  =  j/H£}  j,E  = 


(6.27) 


For  small  error  probabilities,  it  follows  that 


e(j,j)  »  e ( j , j— 1)  =  e(j-l,j)  »  e(j,j-2)  =  e(j-2,j)  »  ...  (6.28) 


which  states  that  if  an  error  occurs,  it  is  most  likely  to  occur  between 
adjacent  hypotheses.  A  straightforward  generalization  of  the  fusion  rule 
given  by  (6.10)  leads  to  the  following  symmetric  fusion  rule 


u 

g 


A 


g(ui'u2) 


u^=U2=k  ,  stop  and  decide 

otherwise,  discard  all  previous 
stages  and  continue. 


(6.29) 
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From  (6.29),  it  follows  that  the  probability  of  error  when  is  true  is 
given  by 


M-l  M-l 

Pr  (e/H  )  =  Y  e2  ( j, k) /  Y  e2(j,k)  k  -  0, 1, . . . , M-l  (6.30) 

3^>  '  j-=0 

M-l 

where  Y  e2(j,k)  is  the  probability  of  obtaining  two  idetical  local 

3=0 

decisions  at  any  stage.  Neglecting  all  errors  between  nonadjacent 
hypotheses,  we  can  approximate  Pr(e/H^)  for  a  middle  hypothesis  by 


Pr (e/H^)  -  2e2  (k-1, k) /{e2  (k, k)  +  2e2  (k-1, k) } ,  k=l, 2, . . . ,M-2  (6.31) 


and  for  the  end  hypotheses  H.  and  H  ,  we  can  write 

U  M-l 


Pr (e/H  )  -  Pr(e/HM  .)  *  e2  (1, 0) /{e2  (0, 0)  +e2(l,0)}  (6.32) 

0  M-l 


Equations  (6.31)  and  (6.32),  show  that  the  error  probability  for  a  middle 
hypothesis  is  approximately  twice  that  of  a  boundary  hypothesis  which  is 
in  agreement  with  the  results  of  centralized  detection  under  the  same 
criterion  [21] . 

The  test  procedure  described  aDove  is  clearly  memoryless  because  the 
local  decisions  are  ignored  if  a  hypothesis  decision  is  r.ot  reached.  De¬ 
noting  by  PT(k)  the  probability  of  global  test  termination  at  any  stage 
provided  that  it  has  not  terminated  at  any  prior  stage  and  is  the  true 
hypothesis.  It  follows  that 

M-l 

PT(k)  =  H  e2(j/>0  » 

3=0 

which  can  be  approximated  as  follows: 
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V10  s 


e2(k,k)  +  2e2  (k-1,  k)  ,  k-1,2,  .  .  .,M-2 

e2  (0,0)  +  e2  (1,0)  ,  k-0  (6.33) 

e2  (M-l, M-l)  +  e2  (M-2 , M-l)  ,  k=M-l 


From  (6.33),  the  average  number  of  observations  ASN(k)  required  for  termi¬ 
nation  when  H,  is  true  can  be  easily  derived.  The  derivation  is  similar 
k 

to  the  binary  hypothesis  case  and  the  result  is  given  by 

ASN(k)  -  NQ/PT(k)  (6.34) 

The  average  error  probability  Pr(e)  and  the  average  sample  number  ASN 
can  be  derived  by  averaging  Pr(e/H^)  and  ASN(k)  respectively  to  obtain 

M-l  M-l 

Pr(e)  -  T”  Pr{H,  is  true}  •  Pr(e/H,)  =*  —  V”  Pr(e/H  )  (6.35) 

L K  KM  i —  K 

k-0  k=0 

M-l  N  M-l 

ASN  *  V"  Pr{ H  is  true}  •  ASN(k)  *  —7  Y~  r — 77—  (6.36) 

Z_  k  M  P  (k) 

k=0  k-0  T 

As  in  the  binary  hypothesis  case, it  seems  that  we  cannot  find  the 
optimal  package  size  N*  analytically  and  numerical  techniques  must  be 
considered.  However,  it  can  be  shown  that  PT ( k)  is  a  monotonically 
increasing  function  of  Nq.  Therefore,  given  the  value  of  Pr(e),  we  can 
find  the  optimal  package  size  N*  by  keeping  Pr(e)  fixed  while  minimizing 
ASN  over  the  set  of  possible  package  sizes.  As  is  evident  from  the 
numerical  results  obtained,  the  optimal  package  size  is  less  than  N  / 4 
for  Pr(e)  5  10  Moreover,  the  ASN  does  not  change  considerably  as  a 

result  of  small  variations  in  the  package  size,  i.e.,  the  ASN  is  a 
relatively  flat  concave  function  of  Nq. 

6.4.  Truncation  of  the  Test  Procedure 

As  was  pointed  out  in  Sections  6.2  and  6.3,  the  number  of  observation 
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packages  necessary  for  the  termination  of  the  global  test  is  a  geometri¬ 
cally  distributed  random  variable.  Therefore,  the  GDM  will  reach  a  deci¬ 
sion  in  a  finite  number  of  trials  (packages) .  However,  the  GDM  may  re¬ 
quire  an  excessively  large  number  of  trials  before  a  decision  is  reached. 
In  other  words,  the  number  of  trials  required  is  an  unbounded  random 
variable.  To  avoid  such  undesirable  situation,  a  reasonable  truncation 
scheme  is  usually  necessary.  The  analysis  of  the  truncation  problem 
associated  with  Wald's  SPRT  is  difficult  [31-33]  and  either  approximate 
expressions  or  bounds  are  available  for  both  the  error  probabilities  and 
the  ASN.  The  truncation  problem  for  the  generalized  SPRT  in  [25]  is  even 
more  difficult  and  no  truncation  analysis  is  reported.  For  the 
memoryless  sequential  procedures  under  consideration,  the  exact  distribu¬ 
tion  of  the  global  test  duration  given  the  hypotheses  is  known  and 
simple.  Therefore,  the  resulting  truncation  problem  is  simpler  to 
analyze.  In  this  section,  we  propose  a  truncation  scheme  and  analyze  its 
performance  for  the  decentralized  binary  hypothesis  testing  problem.  The 
M  hypotheses  case  can  be  treated  similarly. 

The  proposed  truncation  scheme  can  be  described  as  follows:  At  any 

stage  k(k  <,  m) ,  a  package  of  N  observations  is  taken  at  each  LD.  The 

o 

resulting  local  sufficient  statistics  T  (x„,  )  and  T  (y  )  are  com- 

N  — N  N  N 

0  0  O  O 

pared  with  the  thresholds  A*  and  B*,  where  A*  >  A  and  B*  <  B  (see 
definitions  of  A  and  B  in  equation  (6.6)).  The  monotonicity  of  the  LR 
function  of  the  sufficient  statistics  guarantees  lower  error  proba¬ 
bilities  than  a  and  (3.  The  global  test  is  carried  out  sequentially  in 
the  usual  manner.  If  no  decision  is  reached  up  to  the  mth  stage,  each 
local  detector  is  allowed  to  take  one  more  package  of  Nq  observations. 

The  choice  of  equal  package  3izes  at  all  stages  is  necessary  only  to 
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simplify  the  implementation  of  the  test.  At  the  truncation  stage,  the 
local  test  statistics  are  computed  and  single  threshold  tests  are 


performed  as  follows: 


(6.37) 


u  r1 

(2m  >  i  t 

N  — ; N  U  =0 

O  o  ui  u 


u2=l 

tm  >  i  t 
N  N  „  =q 
o  o  uo  u 


where  u_.  *  i  means  that  the  jth,  j=l,2,  local  decision  is  in  favor  of  H^, 
i*0,l.  The  fusion  rules  possible  at  the  truncation  stage  are  AND  and  OR 
rules.  The  symmetry  of  the  problem  implies  that  these  fusion  rules  have 
identical  performance  at  aT  and  p^,  where  aT  and  PT  are  the  error  proba¬ 
bilities  at  the  truncation  stage  with  aT  =  PT  >  a  =  P-  We  assume  the  AND 
fusion  rule  here,  therefore,  the  global  decision  u  is  H  if  and  only  if 
u^  m  u^  *  1  and  u^  is  otherwise.  Consequently,  we  may  write 


“t  =  ai  *  a2  3  ae  =  {1  "  *  (  } 2 


Vn  a 
o 


1-PT  -  (l-pj)  •  a-p2)  -  a-Pt.!  - 

O 


Equating  aT  and  PT,  it  follows  that  the  threshold  t  in  (6.37)  is  the 


unique  solution  of  the  equation 


N  0  -  t 

{1  -  <t>  )}2  =  1  -  (~^= - ) 

VN  O  \N  O 

O  O 


(6.38) 


Similarly,  in  the  case  of  an  OR  fusion  rule,  it  can  be  easily  shown  that 
the  threshold  t'  is  the  unique  solution  of  the  equation 
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(6.39) 


1 


&  <-£r-> 
VN  CT 
o 


n  e  -  t' 

{1  -  4)  (-2= - )}2 

NN  O 
O 


Let  Vn  0-t'  =  t,  we  obtain 
o 


N  0  -  t 

1  -  &  (-2= - )  =  {1  -  4>  (-— — )}2 

VN  O  VNn  O 

O  ° 


(6.40) 


which  is  the  3ame  as  equation  (6.38).  Therefore,  we  conclude  that  the 
performance  of  the  AND  and  OR  fusion  rules  is  the  same  and  that  their 
thresholds  are  related  by 

t  =  Vn~0  -  t  (6.41) 

o  o  a 

where  t  and  t  are  the  thresholds  associated  with  the  OR  and  AND  fusion 
o  a 

rules  at  aT  *  P^  respectively. 

The  expressions  of  the  global  error  probabilities  a  and  P  for  the  pro¬ 
posed  truncation  scheme  are  obtained  by  averaging  their  values  at  all 
stages  as  follows: 


a  =  P  =  a* 


Pr{ T  £  mN  /H  }  +  a 
g  o  0  T 


Pr{ T  >  mN  /H.}  (6.42) 

g  o  Cr 


where  T^  denotes  the  global  test  duration, 
written  as 


Equation  (6.42)  can  be 


a  -  f 

i  -  a*[l  - 

<l-p*2-p*2)m]  +  a?  [l-p*2-pj2]m 

(6.43a) 

a  «  p 

-  a*  +  (aT-a*)  +  [i-P*2-P*2]m 

(6.43b) 

from  which  it  is  obvious  that  by  increasing  m,  the  coefficient  of  (a^-a*) 
decreases  monotonically .  Consequently,  there  exists  an  integer  m  such 
that  a  «  p  S  a  *  p. 
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The  ASN  of  the  truncated  sequential  procedure,  denoted  by  ASN  ,  has 
the  same  value  under  both  hypotheses  given  by 


ASN  =  N 


m 

T 

)t=0 


k[l  - 


Fo 


-  pj2)k" 


+  [1  - 


p* 

Fo 


P*2> 


(6.44) 


Simplifying  equation  (6.44),  we  obtain 


1  -  (1 


ASN 


N 


-P*2)m 


(P? 


+  PJ'  ) 


(6.45) 


T 

where  m  is  the  truncation  stage.  From  (6.45),  it  is  evident  that  ASN 

depends  on  the  choice  of  N^  as  well  as  the  thresholds  A*  and  B* .  The 

optimization  of  the  package  size  N^  for  a  given  A*  and  B*  is  simple  and 

can  be  done  in  the  same  way  used  for  the  untruncated  test.  On  the  other 

hand,  if  we  fix  the  package  size  Nq,  then  an  optimal  choice  of  A*  and  B* 

is  not  simple  due  to  the  continuous  nature  of  the  variables  A*  and  B*. 

T 

The  minimization  of  ASN  requires  an  optimal  choice  of  both  N^  and  the 

thresholds  A*  and  B*,  which  is  not  easy  to  accomplish.  However,  it 

should  be  emphasized  that  the  superiority  of  the  sequential  test 

procedure  over  the  optimal  FSS  test  procedure  implies  that  the  truncated 

sequential  procedure  must  have  a  higher  average  sample  number  than  its 

T 

untruncated  counterpart,  i.e.,  ASN  >  ASN.  This  is  so  because  the 

truncated  te3t  is  actually  a  mixture  of  both  the  sequential  and  FSS  test 

procedures.  Consequently,  a  reasonable  truncated  sequential  procedure  is 
T 

obtained  when  ASN  -  ASN  as  demonstrated  in  the  numerical  results. 


6.5.  Numerical  Results 

In  this  section,  we  present  some  of  the  numerical  results  obtained. 
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The  observation  model  for  the  binary  hypothesis  case  is  as  given  by  (6.1) 
with  0  =  0.2  and  O2  =  1.  We  denote  by  N„_,  and  N  the  number  of 
observations  required  by  the  optimal  fixed-sample-size  distributed  system 
consisting  of  one  and  two  local  detectors  respectively.  Similarly,  we 
denote  by  ASN^  and  ASN^  the  average  number  of  observations  required  for 
the  termination  of  the  global  test  when  one  and  two  local  detectors  are 
employed  respectively.  The  numerical  results  obtained  for  different 
values  of  error  probabilities  are  summarized  in  Table  6.1.  These  results 
indicate  that  for  small  values  of  error  probabilities  (a=p) ,  our  distri¬ 
buted  sequential  procedure  is  far  more  efficient  than  the  distributed  FSS 
system  and  that  its  efficiency  (ASN  /N  )  improves  monotonically  as 
the  error  probabilities  decrease.  Moreover,  our  distributed  procedure 
requires  less  than  60%  of  the  ASN  required  by  the  Lee-Thomas  [21]  MLGDS 
procedure . 

The  observation  model  for  the  M  hypotheses  case  is  as  given  by 
(6.23)  with  0  =  0.2,  a2  =  1,  and  M  -  10.  In  Table  6.2,  E(n)  denotes 
the  average  sample  number  of  the  sequential  test  in  [25] .  The 
numerical  results  obtained  for  some  values  of  the  error  probability  are 
given  in  Table  6.2.  From  these  results,  it  is  evident  that  the 
increase  in  the  ASN  due  to  both  grouping  of  the  observations  and  the 
memroyless  nature  of  the  test  procedure  is  less  than  25%  of  E(n). 
Moreover,  the  distributed  system  requires  approximately  60%  of  the 
average  number  of  observations  (ASN^)  required  for  the  termination  of 
the  single  sensor  scheme  in  [27],  which  is  quite  an  interesting  result 
knowing  that  the  centralized  processing  necessarily  requires  50%  of 
ASN1 . 

Finally,  the  proposed  truncation  scheme  is  applied  to  the  binary 
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hypothesis  case  with  the  same  parameters  used  in  obtaining  Tables  6.1. 

The  optimization  is  performed  by  first  choosing  slightly  smaller  values 

of  a*  and  J3*  {a*  =  (3*)  than  the  specified  values  of  a  and  [3  (a=(3)  .  The 

package  size  is  then  varied  while  adjusting  the  thresholds  A*  and  B* 

T 

such  that  a*  and  (3*  are  fixed  for  all  values  of  Nq.  The  resulting  ASN^ 

is  calculated  for  all  values  of  N  and  the  minimization  is  obtained  by 

o 

T 

selecting  the  package  size  that  yields  the  minimum  of  ASN^ .  In  Table 
6.3,  we  give  the  values  of  a*,  a :  ,  a,  the  termination  stage,  ASN^,  and 
ASN^  for  some  values  of  a  and  (3.  Comparing  the  values  of  ASN^  and  ASN2' 

we  observe  that  they  are  almost  equal  for  all  values  of  a  and  (3  (a=|3)  . 

At  —  6 

The  maximum  difference  8  =  ASN2  -  ASN2  is  obtained  for  a=|3=10  and  is 

equal  to  2.2  observations,  which  corresponds  to  a  percentage  increase 

(8/ASN2)  X  100%  2  0.37%. 
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Table  6.1.  Results  for  the  binary  hypotheses  case,  9=0.2,  a2=l. 


a=p 

NT  M 

FSS1  FSS2 

ASNX 

asn2 

*SVSFSS1 

asn2 

/NFSS2  *sVfiS!,l 

io"4 

1383  967 

664.2 

405.7 

0.48 

0.42 

0.611 

10-6 

2259  1593 

993.5 

598 

0.44 

0.375 

0.602 

10-8 

3149  2231 

1310.5 

78G.1 

0.416 

0.35 

0.595 

io'10 

4046  2875 

1619.8 

955.3 

0.4 

0.332 

0.59 

10~12 

4948  3529 

1922.7 

1125.8 

0.39 

0.32 

0.585 

io'14 

5853  4183 

2220.8 

1295.1 

0.38 

0.31 

0.583 

i 

Table  6 

.2.  Results 

for  the 

M  hypotheses  case, 

M=10, 

9=0 . 2  and  C 

r2=l. 

Pr  (e) 

nfssi 

ASN 

ASN2 

E(n)  ASVNFSS1  ASNi/E(n) 

asn2/asn1 

io-4 

1494 

823 

511.6 

666.5 

0.551 

1.235 

0.622 

io-6 

2373  11 

62.1 

711.1 

934.3 

0.49 

1.244 

0.612 

io-8 

3264  1489.8 

899.2 

2296.4 

0.546 

1.245 

0.604 

io-10 

4162  1806.0 

1080.7 

1454.8 

0.434 

1.241 

0.598 

io'12 

5064  2115.8 

1255.8 

l^iO. 6 

0.418 

1.237 

0.594 

io-14 

5969  2420.3 

2427.2 

1964.4 

0.405 

1.232 

0.589 
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Table  6.3.  Results  of  the  proposed  truncation  scheme,  M=2,0=O.2 


and  a2 =1 . 


a*=f3* 

a  =  8 

T  T 

a  =  p 

Truncation 
Stage  m 

asn2 

T 

asn2 

9.5X10-5 

1.755X10" 2 

0.996X10"4 

6 

405.7 

406.8 

9.473X10- 7 

4 . 05xl0~3 

0. 988xl0~6 

7 

598 

600.2 

9.591X10”9 

1.22X10"3 

0.99X10"8 

9 

780.1 

781.5 

9.573X10-11 

3.55X10"4 

0.967X10-10 

11 

955.3 

95"1 . 1 

9.534X10-13 

1.08xl0~4 

0.971X10"12 

12 

1125.8 

1128.2 

9.478X10-15 

3.83X10"5 

0.966X10"14 

14 

1295.1 

1296.8 

6.6  Discussion 

In  this  chapter,  we  have  extended  the  results  of  Lee  and  Thomas  [21] 
to  a  distributed  system  consisting  of  two  local  detectors.  The  resulting 
sequential  procedure  exhibits  the  simplicity  of  the  FSS  test  procedures 
while  maintaining  the  performance  superiority  of  the  sequential  pro¬ 
cedures  as  is  clear  from  Table  6.1.  In  addition,  we  have  considered  the 
case  of  M  hypotheses  without  increasing  the  complexity  of  the  test  over 
the  binary  hypothesis  case.  The  numerical  results  obtained  for  M 
hypotheses  show  clearly  a  substantial  saving  in  the  average  sample  number 
at  the  local  level  as  a  result  of  decentralization.  Consequently,  on  the 
average,  the  decision  (detection)  process  is  much  faster.  The  numerical 
results  obtained  for  the  truncation  scheme  indicate  that  its  effect  on  the 
average  sample  number  i3  very  insignificant.  Therefore,  the  test  is 
practically  realizable  in  the  3ense  that  its  duration  is  bounded.  Fin- 


110 


ally,  it  is  worth  mentioning  that  the  distributed  system  can  be  gener¬ 
alized  to  have  more  than  two  local  detectors  and  different  fusion  rules. 


Ill 


CHAPTER  SEVEN 


NONPARAMETRIC  SEQUENTIAL  DETECTION  BASED 
ON  MULTI SENSOR  DATA 

7.1.  Introduction 

Several  attempts  have  been  made  in  the  literature  to  combine  the 
desirable  properties  of  nonparametric  fixed-sample-size  (FSS)  tests  and 
the  advantages  of  sequential  tests.  The  Kassam-Thomas  dead  zone  limiter 
(DZL)  [35]  is  a  generalization  of  the  classical  FSS  nonparametric  sign 
detector  [36,37].  The  DZL  in  [35]  exhibits  performance  superiority  over 
the  FSS  sign  detector.  This  performance  gain  is  achieved  at  the  expense 
of  additional  implementation  complexity.  A  read-only-memory  (ROM)  is 
required  for  the  implementation  of  the  test  in  [35]  .  For  small  values  of 
error  probabilities  a  and  p,  where  a  is  the  probability  of  error  of  the 
first  kind  and  P  is  the  probability  of  error  of  the  second  kind,  the  size 
of  the  ROM  required  is  ] arge  and  hence,  the  complexity  increases  a3  a  and 
p  decrease.  The  performance  of  the  DZL  in  [35]  has  been  further  improved 
by  Shin  and  Kassam  [22]  by  means  of  sequential  testing.  The  sequential 
test  in  [22]  called  conditional  sequential  DZL  is  similar  to  the 
sequential  probability  ratio  test  (SPRT)  of  Wald  [1],  and  it  requires  ROM 
accessing  and  a  two  threshold  test  after  each  individual  observation. 

This  greatly  increases  the  complexity  of  the  test.  This  added  complexity 
motivated  Tantaratana  and  Poor  [24]  to  consider  a  two-  stage  version  of 
the  conditional  sequential  DZL  test  in  [22]  to  reduce  the  implementation 
complexity  while  retaining  the  performance  superiority  of  the  sequential 
test3.  Tantaratana  [23]  considered  the  formulation  of  the  conditional 
sequential  DZL  in  [22]  as  a  simple  random  walk  between  two  fixed  absorb¬ 
ing  boundaries.  His  formulation  does  not  require  the  table  look-up  oper- 
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at  ion  and,  therefore,  the  implementation  of  the  test  is  much  simpler.  The 


restriction  of  the  random  walk  formulation  to  the  simple  case  in  which  the 
positive  step  is  equal  to  the  magnitude  of  the  negative  step  is  not 
suitable  and  leads  to  undesirable  increase  in  the  average  sample  numbers 
(ASN'S).  In  fact,  it  can  be  easily  shown  from  the  theory  of  random  walk 
[29]  that  for  small  number  of  states  as  in  [23],  a  truncation  is  neces¬ 
sary  to  limit  a  from  becoming  much  larger  than  its  prespecified  value. 
However,  the  truncation  scheme  in  [23]  is  based  on  the  number  of 
observations  whose  magnitude  is  larger  than  a  fixed  real  number. 

Therefore,  the  truncation  in  [23]  has  no  effect  on  limiting  the  actual 
number  of  observations  from  being  excessively  large.  Moreover,  it  should 
be  mentioned  that  if  no  truncation  is  employed,  then  the  simple  random 
walk  formulation  [29]  requires  a  large  number  of  states  thereby  leading  to 
a  substantial  increase  in  the  average  number  of  observtions  required  for 
the  test  termination  under  both  hypotheses. 

In  this  chapter,  we  show  that  the  conditional  sequential  DZL  in  [22] 
can  be  implemented  without  the  need  for  the  table  look-up  operation.  In 
other  words,  we  can  sequentially  test  against  two  fixed  thresholds  which 
can  be  chosen  prior  to  the  test .  The  threshold  design  is  the  same  as  that 
of  the  SPRT  [1],  In  Section  7.3,  we  study  the  formulation  of  the  test  as 
a  random  walk  without  a  restriction  on  the  steps  to  be  equal.  In  Section 
7.4,  we  generalize  the  nonparametric  sequential  sign  test  to  a  distributed 
system  consisting  of  two  local  sensors  and  a  global  decision  maker  as 
shown  in  Fig.  7.1.  in  Section  7.5,  we  derive  a  conditional  sequential  DZL 
detector  for  the  distributed  system  in  Fig.  7.1.  it  i3  shown  that  if  the 
observations  are  spatially  independent  and  have  identical  distributions, 
and  that  no  excess  over  the  thresholds  is  assumed,  the  resulting  tests 
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x  1 1  ,x12’ 


x21,x22’ 


require  half  the  number  of  observations  required  for  the  single  detector 
in  [22].  In  Section  7.6,  we  present  a  numerical  example  for  the 
nonparametric  sequential  sign  test.  Finally,  in  Section  7.7,  we  conclude 
the  chapter  with  a  discussion  of  the  results  obtained. 


7.2.  The  Nonparametric  Sequential  Sign  and  Conditional  Sign  Tests 

Consider  the  problem  of  detecting  a  constant  signal  9  corrupted  by  an 
additive  noise.  The  noise  pdf  is  assumed  to  be  symmetric,  continuous,  and 
with  zero  mean.  Let  X  ,  X^,...  be  a  sequence  of  iid  ramdom  variables 
which  represent  the  successive  observations.  This  detection  problem  is  a 
problem  of  testing  the  null  hypothesis  H0  against  the  location  alternative 
H  ,  where 

Hq  :  Xi  .  f(x) 

(7.1) 

Hi  :  Xj_  -  f  (x-0)  ,  9  >  0 

with  f(x)  =  f(-x).  We  denote  by  x^,  x^  ,  ...  a  realization  of  the  obser- 

A 

vations  X  ,  X  , ...  .  Let  x  =  [x,  x.  ...  x  ]  be  a  vector  of  observations 
1  l  ~n  12  n 

of  3ize  n. 

The  sequential  sign  test  in  [36]  is  a  SPRT  performed  on  the  sign  of 
the  observations.  Each  observation  x^  (i  S  1)  is  passed  through  a  hard 
limiter  whose  output  is  given  by 


Let 


if  x.  >  0 
i 

(7.2) 

if  x.  <  0 

i 


p0  =  Pr(Yi  =  f  1/9)  =  1  -  F  ( -9 ) 


(7.3a) 
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F(-8) 


(7.3b) 


q0  =  Pr(Yi  -  -  1/0)  - 


where  F(x) 


f(t)dt.  It  follows  that  for  any  pdf  f  (x)  as  defined 


earlier,  the  hypothesis  testing  problem  is  now  given  by 

H0  :  Yi  ~  B<P0  =  *■'*) 


(7.4) 


H1  :  Yi  »  B(p0  >  1/2) 

where  B(p)  is  a  Bernoulli  random  variable  with  probability  of  success 
(Y^=+l)  equal  to  p,  and  probability  of  failure  (Y^=  -1)  equal  to  l-p=q. 
When  Hq (0  =  0)  is  the  true  hypothesis,  the  distribution  of  the  quantized 
observations  is  invariant  in  the  class  C  of  continuous,  symmetric,  and 
zero  mean  pdf's.  It  follows  that  a  SPRT  based  on  Z^'s  is  nonparametric 
[36],  in  the  sense  that  it  has  a  fixed  a  for  any  X  EC.  Following  Wald 
[1],  the  SPRT  based  on  the  Y^'s  is  as  follows. 

n  f  t  log  (l~P)/a  =  log  t^  ,  decide 

f  2.  <  log  |5/(l-a)  =  log  t£  ,  decide  HQ  (7.5) 

1  ^  otherwise,  continue. 

f (y./H) 

1  1 

where  Z^  =  log  — — yy  ,  f(y^/H,)  is  the  probability  density  function  of 

the  discrete  random  variable  Y^  when  the  hypothesis  H^,j*0,l/  is  true,  and 
t  and  t„  are  the  upper  and  lower  thresholds  respectively.  Let  n  be  the 

U  £  p 

number  of  positive  observations  at  stage  n,  and  let  pQ  be  an  estimate  of 
pQ  which  we  use  in  designing  the  test.  The  random  variable  Z,  in  (7.5) 

V  1 

assumes  the  value  log (p0/ (1/2) )  when  Y^=+l  and  the  value  log ( (l-p0) / (1/2) ) 
when  y^  =  -1.  Therefore,  equation  (7.5)  can  be  written  explicitly  as 


!>  log  t  ,  decide  H, 
u  1 

<,  log  t^  ,  decide  (7.6) 

otherwise,  continue. 
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where  n  is  the  number  of  observations  such  that  Y.  is  positive  and  n  is 
P  1 

the  total  number  of  observations  taken  so  far.  Neglecting  any  excess 
over  the  test  thresholds  upon  termination.  The  average  sample  number 
(ASN0) ,  defined  as  the  average  number  of  observations  required  for 

\J 

termination  when  9  is  the  true  parameter  is  approximated  [1]  by 


ASN0 


L (9)  log  t-  +  tl-L(G) ]log  t 
^  ^ 

—  E[Z  /0J 


(7.7) 


where  L(0)  is  the  operating  characteristic  (OC)  function  of  the  test,  and 
1-L (0 )  is  the  power  function  of  the  test.  To  obtain  L(0),  we  solve  the 
parametric  equation 


f (y./H  ).h(0) 

TTTTvJ  “V91  ’  1  <1-8» 

i  0 

where  the  summation  is  over  all  possible  values  of  y^,  and  f(y^/0)  is  the 
pdf  when  0  is  the  true  parameter.  Wald  [1]  has  shown  that  h(0)  is  unique 
and  that  L(0)  is  given  in  terms  of  h (0 )  by 


L(0) 


th(0) 

u 

t  h(9) 
u 


.  h(0) 

l 


(7.9) 


In  our  particular  case,  the  random  variable  Y^  assumes  the  values  ±  1 
only.  Therefore,  we  can  write  equation  (7.8)  as  follows: 

pe<2pe)h<e)  +  qe(2(l-p0))h(0)=  1  (7.10) 

where  p0  =  Pr(Y^  -  1/0]  and  q^  -  l-p0  From  the  definition  of  Z^  in  (7.5) 


it  is  clear  that  E(Z./0)]  is  given  by 


(7.11) 


E (Z^/0) ]  =  pQ  •  log  2£q  +  log  2(l-pQ) 
when  Hq  is  true  (0=0),  it  follows  that  p^  =  =  1/2.  Therefore,  h(0)=l 

and  L(0)=l-a.  Similarly,  when  Pg  =  Pg  (our  estimated  value),  it  follows 
that  h(8)  =  -1  and  L(0)  =  (J,  which  means  that  the  power  of  the  test  is 
1-p  as  required. 

The  conditional  sequential  D2L  detector  is  similar  to  the  sequential 
sign  detector.  The  only  difference  is  that  it  employs  the  following  DZL 
quantizer 

1+1,  if  >  c 

0,  if  -c  S  x,  <  c  (7.12) 

1 

-1,  elsewhere 

Let 

p„  =  Pr(Y.  »  +  1/0)  -  1  -  F (c-0)  (7.13a) 

0  l 

qg  =  Pr(Yi  =  -  1/8)  =  F  (-C-0)  (7.13b) 

rQ  =  Pr^  =  0/0)  =  1  -  Pg  -  qg  (7.13c) 

When  Hq  is  tr  e,  (0=0),  it  follows  that  pg  =  q^  for  all  X  £C.  However, 
the  actual  value  of  Pg  depends  on  f (x)  as  given  by  (7.13a) .  In  order  to 
obtain  a  nonparametric  test  we  must  base  our  test  on  a  random  variable 
whose  distribution  is  invariant  to  f (x)  when  Hg  is  true.  Let  us  define 
as  follows 

W  =  sgn(Y./Y.  =  ±  1)  (7.14) 

li 

Clearly,  is  a  random  variable  defined  on  the  set  of  observations  such 

that  Y.  =  ±  1.  Given  that  an  observation  Y.  is  in  that  set,  the  con- 
i  i 

ditional  distribution  of  given  the  hypothesis  is  as  follows 
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Ho  :  wi  ~  B(po  =  1/2> 

H  :  W  „  B(p^  >  1/2) 


(7.15) 


where  p^=pg/ (pg+q^) .  When  HQ  is  true  (8=0),  it  is  clear  that  p  / (p0+qQ)  = 

1/2  and  consequently,  the  test  based  on  is  nonparametric .  The  SPRT 

based  on  w^'s  is  similar  to  the  sign  test,  the  only  difference  is  that  when 

Y^  =  0,  the  observation  is  ignored.  Let  m  be  the  number  of  observations 

of  W.,  i.e.,  m  is  the  number  of  observations  such  that  Y.  =  ±  1.  It 
i  i 

follows  that  the  ASNg  expressed  in  terms  of  m  is  given  by  (1] 

L (0 )  log  t  +  [1-L (8) )  log  t 

ASN0  (m)  =  - - *  (7.16) 

f (w  /H  ) 

where  Z.  =  log  — — ■■  ,  and  L(6)  is  computed  using  (7.7)  and  (7.9) 
i  r  (w. /H. ) 

l  0 

after  replacing  y.  with  w. .  Observe  that  the  observations  Y.  =  0  have  no 
li  i 

effect  on  the  power  of  the  test  simply  because  they  are  not  included  in 

the  computation.  However,  those  observations  have  a  major  effect  on  the 

ASNg  when  computed  on  the  actual  (unconditional)  observations.  Observe 

that  ASN« (m)  is  the  average  sample  number  conditioned  on  the  event  Y.=±l. 
o  i 

To  derive  the  actual  average  sample  number  ASNQ,  we  observe  that  the 
event  has  a  probability  of  occurrence  equal  to  (Pg  +  qg)  for  any 

individual  observation.  Therefore,  for  any  vlue  of  ASNQ (m) ,  say  k,  the 

V 

distribution  of  the  actual  number  of  observations  N  is  the  Pascal  (nega¬ 
tive  binomial)  distribution  as  given  by 


Pr (N=n/ASN„ (m)=k) 
tt 


n-1 

k 


1 


(p8+q8)k  (1”pe-qe)n  kf  n=k'k+1-- 


(7.17) 
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The  actual  average  sample  number  ASNg  is  obtained  by  averaging  over  all 


possible  values  of  N.  The  result  is  known  and  is  given  by 


ASN0 


ASNq (m) 

(P0  +  V 


(7.18) 


7.3.  Random  Walk  Formulation  of  the  Tests 

Both  the  nonparametric  sequential  sign  and  conditional  sign  tests  de¬ 
scribed  in  Section  7.2  employ  a  two-valued  discrete  random  variable  for 
testing  as  is  evident  from  (7.4)  and  (7.15).  It  follows  that  these 
sequential  tests  can  be  easily  implemented  as  a  random  walk  on  a  finite 
number  of  states  as  described  in  Chapter  Two.  Tantaratana  (24]  con¬ 
sidered  the  case  in  which  the  nonparametric  sequential  conditional  sign 
test  is  formulated  as  a  simple  random  walk  on  the  states 

1  N 

as  shown  in  Fig.  7.2.,  The  states  S.  and  are  absorbing  states  with 

U  N 

associated  decisions  HQ  and  respectively.  The  states  S^, 

are  all  transient  with  an  associated  decision  of  continue  in  an  analogy 

with  Wald's  SPRT  [1].  Initially,  the  process  3tarts  at  state  S  0<Z<N, 

z 

and  it  goes  one  step  higher  (A+  =  1)  if  the  current  observation  is  posi¬ 
tive  and  one  step  lower  (A-  =  1)  if  the  observation  is  negative.  As  soon 

as  the  process  reaches  one  of  the  absorbing  states  S.  or  S, .  the  test 

0  N 

terminates  with  the  acceptance  of  the  hypothesis  associated  with  that 
state.  The  results  for  equal  step  random  walk  [29]  can  be  used  in  con¬ 
junction  with  the  observation  model  of  (7.4)  and  (7.15).  It  is  well 

known  [29]  that  for  a  process  starting  at  S  ,  the  probability  of  absorp- 

z 

tion  in  state  is  given  by 
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Decide  Hj 


(7.19) 


u 


z 


(q/p) 2  -  (q/p)N 
1  -  (q/p)N 


where  q  =  Pr{A-}  and  p  =  Pr{A+} .  When  is  true,  we  have  q^  =  p^  =  1/2, 
therefore,  we  obtain 


a  -  l 


lim  u 
q/p->l  z 


Z 

N 


(7.20) 


On  the  other  hand,  when  is 
estimated  (nominal)  values  q~ 


true,  we  substitute  for  q  and  p  their 
and  Pg  from  (7.6),  to  obtain 


(VV2-(Vg9)N 

1  -  (VP0)N 


(7.21) 


The  random  walk  formulation  of  the  test  requires  the  specification  of  both 

Z  and  N  such  that  a  and  P  are  satisfied  at  the  nominal  parameter  values. 

However,  it  is  obvious  from  (7.20)  that  for  Z-l,  we  must  have  N-a  1, 

therefore,  the  number  of  states  necessary  to  satisfy  a  is  extremely  large 

for  small  values  of  a.  On  the  other  hand,  for  sufficiently  large  N,  we 

N 

can  neglect  the  term  (q_/p.)  in  equation  (7.21)  to  obtain 


P  2  <VPe>Z  (7,22) 

It  follows  from  (7.22)  that  Z  -  log  (p  S /log (q^/^g) ,  and  thus,  Z  must  be 
greater  than  one,  in  general,  yielding  a  further  increase  in  the  number 
(N+l)  of  required  states.  Consequently,  the  average  sample  number,  which 
is  directly  related  to  N,  is  very  large.  To  overcome  this  highly  un¬ 
desirable  property  while  still  maintaining  implementation  and  analysis 
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simplicity,  we  propose  to  allow  the  magnitude  of  the  negative  step  to  be 

tv/ice  that  of  the  positive  step,  i.e..  A-  =  2,  and  A+  =  1  as  shown  in  Fig. 

7.3.  According  to  Fig.  7.3,  the  states  and  are  absorbing  states  and 

associated  with  H.  decision  while  S  is  an  absorbing  state  associated  with 
0  N 

decision. 

Starting  the  process  in  state  S^,  1  <  j  <  N,  we  can  express  the 

probability  of  absorption  in  state  as  follows: 

N 


u  . 
3 


PUj+l  +  **j-2 


(7.23) 


where  u.,  0  <  j  <  N,  is  the  probability  of  absorption  in  state  St  start - 
3  N 

ing  from  the  state  S_.  and  p(q)  is  the  probability  of  positive  (negative) 
step.  Using  the  method  of  particular  solutions,  we  obtain 


u.  =  A  +  B (x  ) 3  +  C (x  ) 3 
3  1  2 

u  =  1  (7.24) 

N 

u  =  u  =0 
0  1 


where  A, B,  and  C  are  constants  to  be  determined  from  the  boundary  condi¬ 
tions  u.,  u, ,  and  u  .  while  x,  and  x„  are  the  roots  of  the  second  order 
U  1  N  12 

homogeneous  equation 

px2  -  qx  -  q  =  0  (7.25) 

Solving  (7.25),  we  obtain  x^= (q  +  Vq2  +  4pq) /2p  and  x^=(q  -  Vq2  +  4pq)/2p. 
Substituting  the  boundary  conditions  in  (7.24),  we  obtain  for  j=Z  that 
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Fig.  7.3:  An  (Nrl)  state  random  walk  process  with  A+  =1  and  A  =2. 
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(7.26) 


uz 


<x  -x  )/(l-x  >  -  (  (l-x2>  /  (l-x1>  )  x*  +  x2 
<Xi_X2> / (1~:  i>  "  < <1-x2) ^ (1"X1) )  X1  +  X2 


Let  Z, . ,  i=0,l,N,  denote  the  probability  of  absorption  in  the  state 
starting  from  the  state  Sz .  Therefore,  I;  is  equal  to  uz  as  given  by 
(7.26).  The  probability  of  absorption  in  the  state  is  obtained  by 
solving  the  following  stochastic  difference  equation 


u.  =  Puj+1  +  qu._2 

uo  =  1 


(7.27) 


u  =  u  =0 
1  N 

whose  solution  is  similar  to  that  of  (7.23)  and  the  result  is  given  by 


X2<X1  1_X2  1)~((x2>/<x1))  <1_x2  1)/(1-x^  1>x^  +  x2 
^  x2<xi"1_X2~1)/<1_Xl"1)'(<X2)/(Xl)) (1_X2~1)/((1_X1-1>  +1 


Because  the  process  will  ultimately  be  absorbed  in  one  of  the  absorbing 
3tates,  it  follows  that 


■  1  -  *0  - 


(7.29) 


The  average  number  of  steps  until  absorption  when  9  is  the  actual 
parameter  is  denoted  by  ASN0  and  is  given  by  [30]  by 


ASNe 


I  %  .  (i-Z) 

II  _ 

?eA+  -  V 


I  Z, .  (i-Z) 

II  _ 

3P0  -  2 


1=0,  1,  N 


(7.30) 


where  pg  and  q^  are  as  defined  in  (7.13) 

The  number  of  states  N+l  and  the  starting  point  Z  must  be  chosen  such 

that  when  H.  is  true  (p„  =  1/2),  we  have  u„  £  a  and  when  H  is  true  (p„  = 
0  U  N  1  U 

j5Q),  we  have  t  >  1-G.  Although  we  have  so  far  considered  a  single 
u  N 
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detector  case,  our  results  can  be  extended  to  the  case  of  two  detectors 
in  a  straightforward  manner.  The  only  problem  encountered  is  that  the 
roots  of  the  stochastic  difference  equation  governing  the  random  walk  can 
not  be  obtained  analytically  and,  therefore,  numerical  solutions  must  be 
considered.  Equation  (7.17)  shows  that  the  actual  ASNq  is  a  function  of 
the  pdf  of  the  observations  and  the  parameter  of  the  DZL,  which  can  be  de¬ 
signed  to  satisfy  some  criterion  of  optimality  [22]  without  affecting  the 
nonparametric  property  of  the  test.  It  should  be  emphasized  that  the 
test  is  performed  sequentially  on  the  set  of  observations  |X^|  >  c  in 
exactly  the  same  way  the  sign  test  is  performed,  without  any  need  for  the 
table  look-up  as  it  was  originally  required  by  Shin  and  Kassam  in  |22], 


7.4.  Distributed  Nonparametric  Sequential  Sign  Test 

In  this  section,  we  generalize  the  nonparametric  sequential  sign  test 
in  [22]  to  a  two-sensor  distributed  environment.  The  system  is  as  shown 
in  Fig.  7.1.  It  is  assumed  that  the  observations  at  the  kth  sensor, 
k=*l,2,  are  iid  random  variables  with  a  continuous,  symmetric,  and  zero 
mean  pdf  f^(x).  The  observations  are  independent  from  3ensor  to  sensor, 
i.e.,  X^  is  independent  of  X^  for  all  i,£.  The  detection  problem  is, 
therefore,  a  hypothesis  testing  problem  in  which  the  hypothesis  is 
tested  versus  the  hypothesis  according  to  the  following  model 


H.  :  X.,  „  f  (x) 

0  ik  k 


k=l,  2 


H1  :  -  V-V 


(7.31) 


with  f^(x)  =  f^f-x),  anci  >  0'  k=l,2.  Each  local  sensor  quantizes  its 
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own  observation  into  two  levels  as  given  by  equation  (7.2),  and 
communicates  the  quantized  output  to  the  central  detector.  The  central 
detector  combines  the  quantized  observations  and  performs  the  SPRT.  From 
the  central  processor  ^oint  of  view,  the  hypothesis  testing  problem  in 
(7.31)  is  as  follows: 


H 


ik 


»  B (1/2) 


k=l,  2 


(7.32) 


Let 


Yik  ~  B(Pk> 


Pu  = 


Pr(Yik  =  +  1/6k) 


1  - 


V'V 


(7.33a) 


%  -  Pr(Yik  -  -  1/0k}  -  1  -  V'V  (7-33b) 

x 

where  F^U)  =  J  fk(t)dt.  Observe  that  for  all  X^eC,  the  distribution 

— oo 

Y.,  is  invariant  under  the  hypothesis  H„ .  Let  V.  be  an  observation 

ik  0  — i 

vector  with  Y^  and  Y ^  as  elements,  i.e.. 


V.  =  (Y ,  Y ,  „ ) 
-r  xl  i2 


(7.34) 


Let  f(v,/H.)  be  the  pdf  of  V.  when  H.,  j=0,l  is  true. 
-13  r  -1  3 


Then,  we  have 


f  (v./H.) 
-1  3 


f  (y^/H.) 
il  3 


3=0,1 


(7.35) 


It  should  be  emphasized  that  the  distribution  of  when  i3  true  is 
also  invariant  to  changes  in  the  pdf's  of  the  observations  X^,  k=l,2  as 
long  as  X^eC.  Consequently  the  SPRT  performed  on  is  nonparametric 
(constant  a).  Let  V  ,  v^, . . .  be  a  sequence  of  successive  central  obser- 
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vations.  The  SPRT  at  the  central  level  in  [1]  as  follows: 


n 

F 


z. 

i 


>  log  ,  decide 

<  log  ,  decide 

otherwise,  continue. 


(7.36) 


where 


Z .  =  log 

i 


[f (v  /H  t/f (v  /H  )  ] 


From  (7.36),  it  follows  that 


or, 


z . 
1 


>  1<,*(4IWV/£(WV1 

x=l 


z. 

1 


il 


+  Z 


i  2 


From  (7.37),  we  observe  that 


(7.37a) 


(7.37b) 


EtZi/ei'62J  =  EUil/0l]  +  E[Zi2/02] 


(7.38) 


Following  Wald  [1]  in  assuming  no  excess  over  the  thresholds  at 
termination,  the  ASN(9lf92)  is  given  by 


ASN(0  ,9  ) 


L(91#92)  log  t£  +  [1-L(91,92)]  log  tu 

E<VV02J 


(7.39) 


where  L(0^,8O)  is  the  OC  function,  which  is  now  a  function  of  0  and  02> 
To  obtain  we  need  to  solve  equation  (7.9)  for  h(9)  «  h(91,92) 

“  h.  However,  h  (0 1 , 9 2 )  *  h  is  obtained  as  the  unique  nonzero  solution 
of  equation  (7.8)  which  can  be  written  explicily  in  terms  of  the  four 
possible  values  of  Z^  as  follows: 


PlP2l<1^2]h  +  piq2E4pl^2]h  +  qiP2f4qlP2]h  +  qiq2[4qiq2]h  =  1  (7'40) 

_  A 

where  p^  and  q^  -  1-p  are  the  estimated  values  used  in  designing  the 

n 

steps  of  the  random  process  Z^  ,  while  p^  and  q^  *  1-p^  are  the 
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actual  parameters.  Once  is  computed,  we  can  solve  (7.9)  to 

obtain  From  (7.40)  we  observe  when  is  true  ( 9 ^  =0 2 =<^ >  ' 

h(0,0)  =1.  Therefore,  L(0,0)  =  1-u  as  expected.  Similarly  when 
0  »>0  and  ®2=®2  actuai  parameters  equal  to  their  estimates),  then 

h  (0 1, 0  „  )  =  1  Therefore,  h  (0  ^ ,  0  ^ )  =  (3. 

Finally,  if  the  observations  and  X^2 (i  ^  1)  are  independent  and 
identical,  then  equation  (7.38)  is  as  follows 

E[Z./e  =9-]  =  2  E [Z  /0  ]  (7.41) 

1  1  2  rl  1 

Consequently,  it  follows  from  (7.41)  that  for  the  same  power  of  the  te3t, 

the  ASN  (0,0:0  =  0  )  is  exactly  half  the  ASNQ  in  the  case  of  one 

121  z  y 

sensor  (detector) .  However,  this  result  is  drawn  under  the  assumption  of 
no  excess  over  the  thresholds  which  is  true  only  for  vanishingly  small 
signal  strengths  <0 ^=6 2  ~  •  Moreover,  it  should  be  mentioned  that  when 

and  X^2  are  iid,  the  computation  of  the  OC  function  is  identical  to 
the  single  detector  case  as  given  by  (7.8). 


7.5.  Distributed  Conditional  Sequential  Sign  Test 

We  consider  generalizing  the  conditional  sequential  DZL  detector  of 
Shin  and  Kassam  [22]  to  the  distributed  system  of  two  sensors  3hown  in 
Fig.  7.1.  The  observations  are  as  defined  earlier  in  Section  7.4.  The 
only  difference  from  the  sign  detector  is  that  the  quantizers  are  now 
DZL's  as  defined  in  (7.12).  For  clarity,  we  rewrite  (7.12)  and  (7.13)  to 
obtain . 


f  +1  ,  if  x  >  c 
1  rk 


Y .  =  < 

rk 


0  ,  if  -c  £  x.  £  c  ,  k=l,2 
ik 

-1  ,  elsewhere 


(7.42) 
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and 


p9k  ■  PtfYik  '  *  l/V  • 1  -  Vc-V  (7-43a> 

%k  *  pt'Yik  -  - 1/9k>  -  V'-V  (7-43b) 


c8k  ■  Pr!Yik  ■  0/V  ■  1  ■  p9k  ■  %k  (7'43c> 


When  Hq  is  true  0^0,  02=O,  it  i3  clear  that  pQ1  =  qQ1  and  pQ2  =  qQ2 

for  all  X  £C,  with  actual  values  being  functions  of  the  actual  pdf's  as 
is  obvious  from  (7.43).  In  order  to  obtain  a  nonparametric  test,  we  need 
to  think  in  terms  of  conditional  tests.  To  this  end,  we  arrange  the 
observations  of  the  central  processor  into  three  groups  as  follows: 
i)  Group  1 :  contains  all  possible  combinations  of  Y^  and  Y  such 
that  Y^  =  ±  1  and  Y^2  **  ±  1.  In  this  group,  it  follows  from  the 
independence  assumption  that 

Pr  (Y . ,  =  T),  Y .  =  v/Y,,  =  ±  1  and  Y.„  -  ±  1)  = 

il  i2  il  i2 

(7.44) 


Pr  (Y .  =  T|/Y .  =  ±  1)  •  Pr  (Y  =  V /Y .  _  =  ±  1) 

il  il  i2  i2 

where  tj  =  ±1  and  v  =  ±1.  When  is  the  true  hypothesis,  equation 

(7.44)  reduces  to 

Pr(Yn  =  il,  Yi2  -  v/^il  =  ±  1  and  Yi2  -  ±  1)  =  1/4  (7.45) 

for  any  of  the  four  possible  events  in  the  group.  Similarly,  when  is 
true,  we  have 


Pr  (Y 


il 


Tl,  Y.„  =  v/Y..  =  ±  1  and  Y , .  =  ±  1)  = 

i2  il  i2 

Pr  (Y .  =  T|/H  )  Pr  (Y  =  v/H.) 
il  1  i2  1 


(7.46) 
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where  pg^  and  cjg^  are  the  estimated  values  of  the  parameters  pg^  and  qg^ 
as  defined  in  (7.43). 

ii)  Group  2 :  contains  all  possible  combinations  of  Y^  and  Y^2  such 
that  one  and  only  one  of  them  is  zero.  When  =  0,  we  obtain 

Pr(Y.  =  tl,  Y .  =  0/Y .  =  ±  1  and  Y.  =  0)  =  Pr(Y.  =  T\/l.,  =  ±  1)  (7.47) 

il  i2  ii  i2  ii  il 

Similarly,  when  Y  =  0,  we  obtain 

Pr (Y ,  =  0,  Y .  =  v/Y ,  =  0  and  Y.„  =  ±  l)=Pr(Y  =  v/Y.,  =  ±1)  (7.48) 

il  i2  il  i2  i2  x2 

It  is  interesting  to  observe  that  when  one  of  the  sensors  has  its  observa¬ 
tion  in  the  dead-zone  region  i.e.  IX.^I  <  c,  we  obtain  the  desired  condi¬ 
tional  distribution  by  simply  not  counting  that  sensor  (ignoring  its  cur¬ 

rent  observation).  From  the  results  of  the  single  detector  case  [22], 
the  test  is  still  nonparametric . 

iii)  Group  3 :  contains  the  observation  Y^  =  Y^2  =  0,  which  must  be 
ignored. 

From  (7.44)  -  (7.48),  it  follows  that  the  increments  by  which  the 

logarithm  of  the  likelihood  ratio  function  is  updated  depend  on  the 

random  variables  Y.,  and  Y._.  When  Y.„  and  Y._  are  in  group  1,  the 

il  i2  il  i2  ^ 

increment  is  a  random  variable  given  by 

2Pr (Y.  =  H/H.) 

.  il  1 

Z .  =  log  -  + 

1  p.  +  q. 

91  91 

(7.49) 

2Pr(Yi2  =  TJ/H  > 

log  -  when  Y.„  =  T|  and  Y._  =  V 

6  +  5  il  i2 

92  92 
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However,  when  Y^  =  0  and  Y^  =  v,  the  increment  is  given  by  the  second 
term  in  (7.49)  only,  and  when  Y^  =  TJ  and  Y^  =  0,  the  increment  is  given 
by  the  first  term  of  (7.49)  only.  Therefore,  the  observations  as  seen  at 
the  central  level  are  not  identical  at  all  stages  of  the  test.  Conse¬ 
quently,  it  is  difficult  if  not  impossible  [3]  to  solve  for  the 
ASN ( 0 ^ , 9 2 >  •  Moreover  the  computation  of  the  OC  function  is  also  diffi¬ 
cult,  because  upon  termination  we  do  not  know  the  exact  number  of  usable 
observations  obtained  by  any  sensor.  To  overcome  this  difficulty,  we 
assume  that  the  observations  are  identical  at  both  sensors,  and  propose 
to  perform  the  central  test  by  considering  Y .  first,  and  then  Y  ,  at 
all  stages  (i  >  1) .  In  this  case,  the  OC  function  is  the  same  as  for  the 
single  detector  case  previously  described.  The  ASN0  i3  given  by  the 
following  inequality 

\  ASNg  <  ASNQ  <S  |  [ASNj  +  1]  (7.50) 

where  ASN*  is  the  average  sample  number  for  a  single  detector  when  9  is 
the  actual  parameter. 


7.6.  Numerical  Results 

We  first  present  an  example  of  the  random  walk  formulation  of  the 
sequential  test  discussed  in  Section  7.3,  The  nominal  observation  model 
is  given  by: 

Hq  :  X  .  N(0,  a2) 

,  i  >  1 


H  :  X  _  N(9,  a2) 

where  N(0,  a* )  is  a  Gaussian  random  variable  with  mean  0  and  variance  02 , 
The  nominal  value  of  0  is  assumed  to  be  9  =  0.676  while  the  variance  ct2 
is  assumed  to  be  unity.  The  quantizer  nonlinearity  is  as  given  by  (7.12) 
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with  an  optimal  (22]  value  of  c  =  0.6,  which  yields  a  nominal  value  of  Pg 
given  by  Pg  =  0.84.  Using  Wald's  approximation  to  the  thresholds,  we 
obtain 

N-Z  5  In  (-^)  /  In  2 pi 
u  u 

and 

2  -  <!??  '  2  ln  2a-H' 

_3 

For  a  =  P  =  10  ,  we  obtain  N-Z  -  13.3  and  Z  =  12.12.  The  actual  values 

-3  -3 

required  to  satisfy  a  <  10  and  (3  <  10  at  the  nominal  value  of  0  are 
N  =  27  and  Z  =  12,  which  are  obtained  by  an  exact  solution  of  (7.24)  . 
Presented  in  Table  7.1  are  the  numerical  values  of  the  power  function 
P^fG)  and  the  average  sample  number  ASN{0)  obtained  for  some  value  of  0 
in  the  range  O<0<0,  i.e.,  0.5<p^<p^. 
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Table  7.1:  Performance  of  the  sequential  DZL  detector  when  implemented 


as  a  random  walk 


0 

P0 

V6> 

ASN  (9) 

0.676 

0.84 

0.99903 

45.62 

0.619 

0.82 

0.9974 

52.44 

0.567 

0.8 

0.9936 

60.95 

0.518 

0,7  8 

0.985 

71.7 

0.45 

0.75 

0.951 

93.38 

0.348 

0.7 

0.735 

140.33 

0.167 

0.6 

0.0481 

91.04 

0.1 

0.56 

0.0094 

63.12 

0.066 

0.54 

0.00404 

53.96 

0.034 

0.52 

0.00173 

46.94 

0.0 

0.5 

0.00073 

41.43 

Next,  we  present  some  numerical  results  for  the  nonparametric  sequen¬ 
tial  sign  test  described  in  Section  7.4.  The  observation  model  is  as 
follows : 


H 


H 


C 


1 


Xik 


X.. 

lk 


~  n (0,  a2) 

k 

-  "‘V*’ 


i  >  1;  k=l, 2 


where  N(0k,oj^)  is  as  defined  earlier.  The  nominal  design  values  are 

9  =  0.525,  9  =  0.824,  and  of  ■  1,  k=l,2.  The  resulting  values  of 

12k  0 

-3 

are  Pq^  *  0.7  and  p^  =  0.8.  The  required  value  of  a  is  set  at  a  -  10  , 
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while  the  power  of  the  test  is  required  to  be  PD^2.'®2^  =  at  the 

nominal  (estimated)  values  of  the  parameters. 

In  Table  7.2,  the  parameters  pQ_  and  pQ-  are  varied  such  that  their 

U  1  U  2. 

effects  on  the  power  of  the  test  are  identical.  In  other  words,  the  test 
has  the  same  power  as  if  it  is  performed  based  on  the  observations  of 
sensor  one  alone  or  sensor  two  alone.  In  this  case,  it  is  meaningful  to 


compare  the  resulting  ASN's  as  given  in  Table  7.2  below,  where  ASN^  de¬ 


notes  the  ASN  considering  only  the  kth  sensor  observations. 


Table  7.2.  Performance  of  the  distributed  nonparametric  sequential 
_  sign  test.  _  _  _ 


pei 

P02 

vw 

ASNei 

ASN02 

ASN(01,8  ) 

0.5 

0.5 

1X10-3 

79.07 

30.89 

22.21 

0.5103 

0.516 

2xl0~3 

87.62 

34.23 

24.62 

0.5309 

0.5486 

7.89X10"3 

111.44 

43.64 

31.36 

0.623 

0.6915 

0.8 

243.18 

97.89 

69.8 

0.6814 

0.7753 

0.996 

103 

43.24 

30.45 

0.7 

0.8 

0.999 

83.77 

35.76 

25.06 

0.744 

0.854 

0.99997 

57.77 

25.81 

17.84 

In  Table  7.3,  the  parameters  p^  and  Pq^  are  allowed  to  change  such 
that  their  effect  on  the  test  power  is  not  necessarily  the  same.  We 
denote  by  P  ,  k=l,2,  the  power  of  the  test  performed  based  on  the  obser 

U  K 

vations  of  the  kth  sensor  alone.  Observe  that  this  situation  is  not  en¬ 
countered  in  the  single  detector  case  [22]  where  we  have  a  single  parame- 
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ter  to  deal  with.  This  case  is  intended  to  demonstrate  the  effect  of  the 


parameter  changes  on  the  central  test  power. 


Table  7.3.  Performance  of  the  distributed  nonparametric  sequential 


sign  test  with  unequal  power  functions. 


P01 

P02 

PD1 

PD2 

VW 

ASN(0  ,0.  ) 

0.65 

0.85 

0.9633 

0.99996 

0.9997 

22.86 

0.75 

0.75 

0.99998 

0.9854 

0.9975 

27.7 

0.7 

0.85 

0.999 

0.999996 

0.9999 

20.05 

0.75 

0.8 

0.99998 

0.999 

0.9997 

21.75 

0.6 

0.7 

0.515 

0.857 

0.77 

72.18 

0.65 

0.8 

0.9633 

0.999 

0.9972 

29.52 

7.7.  Discussion 

The  nonparametric  sequential  conditional  sign  test  of  Shin  and  Ka3sam 
wa3  investigated  and  it  was  3hown  that  it  can  be  implemented  without  the 
table  look-up  operation  as  previously  required.  In  addition,  the  above 
test  was  formulated  as  a  random  walk  on  a  finite  number  of  states  and 
exact  analytical  expressions  were  derived  for  both  the  test  power  and 
average  sample  number  functions.  Both  the  sequential  sign  and  condi¬ 
tional  sign  nonparametric  tests  were  generalized  to  a  distributed  system 
of  two  sensors  and  shown  to  maintain  the  desired  nonparametric  property. 
It  was  shown  that  when  the  observations  of  the  two  sensors  are  iid,  the 
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distributed  system  requires  half  the  number  of  observations  required  by  a 
single  sensor.  Thus,  it  has  the  desirable  advantage  of  shorter  decision 
time  in  addition  to  the  well  known  advantages  of  distributed  detection 
systems  like  reliability  and  survivability. 
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CHAPTER  EIGHT 


SUMMARY  AND  SUGGESTIONS  FOR  FUTURE  RESEARCH 


8.1.  Summary 

In  this  report,  we  have  considered  some  sequential  hypothesis 
testing  problems  where  the  detection  network  consists  of  a  number  of 
local  sensors  and/or  detectors.  We  have  studied  a  centralized  SPRT 
based  on  quantized  mutlisensor  data,  and  shown  its  performance 
superiority  over  the  single  sensor  case.  Moreover,  we  have  studied  the 
issues  of  optimal  quantization  of  local  observations,  channel  errors, 
and  random  walk  formulation  of  the  central  SPRT. 

A  simple  multi-sensor  decentralized  sequential  detection  procedure  has 
been  investigated  when  no  explicit  fusion  rule  is  employed.  It  has  been 
shown  that  its  performance  improves  monotonically  with  the  number  of  local 
detectors.  Wext  a  distributed  sequential  detection  system  employing 
explicit  fusion  rule  has  been  considered. 

The  SPRT  of  Wald  has  been  generalized  to  a  distributed  system 
consisting  of  two  local  sequential  detectors  and  a  global  decision  maker. 
The  global  error  probabilities  are  derived  in  terms  of  the  local  error 
probabilities  and  the  fusion  rule  employed  at  the  global  decision  maker. 
Moreover,  the  global  ASN  is  obtained  in  terms  of  the  loc.'l  test  lengths 
and  the  fusion  rule.  In  addition,  we  have  generalized  the  Lee-Thomas 
MLGDS  detection  procedure  to  the  distributed  system  described  above.  The 
results  obtained  in  both  cases  show  the  performance  superiority  of  these 
distributed  tests  over  their  single  detector  counterparts. 
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Finally,  we  have  generalized  the  nonparametric  sequential  sign  and 
conditional  sequential  sign  tests  of  Shin  and  Kassam  to  a  distributed 
system  of  two  local  sensors.  We  have  shown  that  the  distributed  system 
exhibits  an  improved  performance  over  the  single  detector  case  and 
maintains  the  desired  nonparametric  property. 

8.2.  Suggestions  for  Future  Research 

Throughout  this  report,  we  have  assumed  that  the  observations  are 
statistically  independent  and  identically  distributed  at  the  local  sensors. 
However,  in  practice,  the  observations  can  be  dependent  both  spatially  and 
temporally.  Therefore,  one  fruitful  area  for  research  is  to  develop 
suitable  sequential  detection  schemes  under  the  appropriate  dependent 
observation  models. 

Another  possibility  is  to  investigate  sequential  detection  schemes  for 
different  network  structures  such  as  serial  and  tree  topologies. 

Appropriate  fusion  schemes  should  be  developed  and  system  performance 
should  be  evaluated. 
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